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in continuous time; and ii) to review some of the related problems to which physicists have made relevant 
contributions in recent years. 



1 Introduction 

This article comprises the set of notes for the short course 
"Introduction to Econophysics," delivered at the II Brazilian 
School on Statistical Mechanics, held at the University of 
Sao Paulo, in Sao Carlos, SP, Brazil, in February 2004. The 
course consisted of five lectures and was aimed at physics 
graduate students with no previous exposition to the subject. 

The main goal of the course was twofold: i) to provide 
a brief introduction to the basic models for pricing financial 
derivatives; and ii) to review some of the related problems in 
Finance to which physicists have made significant contribu- 
tions over the last decade. The recent body of work done by 
physicists and others have produced convincing evidences 
that the standard model of Finance (see below) is not fully 
capable of describing real markets, and hence new ideas and 
models are called for, some of which have come straight 
from Physics. In selecting some of the more recent work 
done by physicists to discuss here, I have tried to restrict 
myself to problems that may have a direct bear on models 
for pricing derivatives. And even in such cases only a brief 
overview of the problems is given. It should then be empha- 
sized that these notes are not intended as a review article on 
Econophysics, which is nowadays a broad interdisciplinary 
area, but rather as a pedagogical introduction to the math- 
ematics (and physics?) of financial derivatives. Hence no 
attempt has been made to provide a comprehensive list of 
references. 

No claim of originality is made here regarding the con- 
tents of the present notes. Indeed, the basic theory of finan- 
cial derivatives can now be found in numerous textbooks, 
written at a different mathematical levels and aiming at spe- 
cific (or mixed) audiences, such as economists [1, 2, 3, 4], 
applied mathematicians [5, 6, 7, 8], physicists [9, 10, 11], 
etc. (Here I have listed only the texts that were most often 
consulted while writing these notes.) Nevertheless, some 
aspects of presentation given here have not, to my knowl- 
edge, appeared before. An example is the analogy between 
market efficiency and a certain symmetry principle that is 
put forward in Sec. V. Similarly, the discussion of some of 



the more recent research problems is based on the already 
published literature. An exception is Fig. 12 which contains 
unpublished results obtained by R. L. Costa and myself. 

The present notes are organized as follows. Section 
II gives some basic notions of Finance, intended to intro- 
duce the terminology as well as the main problems that I 
shall be considering. In Sec. Ill, I discuss the Brownian 
motion, under a more formal viewpoint than most Physics 
graduate students are perhaps familiar with, and then de- 
velop the so-called Ito stochastic calculus. Section IV con- 
tains what is the raison d'etre of the present notes, the 
Black-Scholes model for pricing financial derivatives. In 
Sec. V, the martingale approach for pricing derivatives is 
introduced. In particular, I recast the notions of market ef- 
ficiency and no-arbitrage as a 'symmetry principle' and its 
associated 'conservation law.' Sections VI and VII discuss 
two possible ways in which real markets may deviate from 
the standard Black-Scholes model. The first of such possi- 
bilities is that financial asset prices have non-Gaussian dis- 
tributions (Sec. VI), while the second one concerns the pres- 
ence of long-range correlations or memory effects in finan- 
cial data (Sec. VII). Conclusions are presented in Sec. VIII. 
For completeness, I give in Appendix A the formal defini- 
tions of probability space, random variables, and stochastic 
processes. 

2 Basic Notions of Finance 

2.1 Riskless and risky financial assets 

Suppose you deposit at time t = an amount of R$ 1 into a 
bank account that pays an interest rate r. Then over time the 
amount of money you have in the bank, let us call it B(t), 
will increase at a rate 



Solving this equation subject to the initial condition B(0) = 
1 yields 

B{t) = e rt . (2) 
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A bank account is an example of a riskless financial 
assets, since you are guaranteed to receive a known (usu- 
ally fixed) interest rate r, regardless of the market situation. 
Roughly speaking, the way banks operate is that they borrow 
from people who have money to 'spare', but are not willing 
to take risks, and lend (at higher interest rates) to people 
who 'need' money, say, to invest in some risky enterprise. 
By diversifying their lending, banks can reduce their overall 
risk, so that even if some of these loans turn bad they can 
still meet their obligations to the investors from whom they 
borrowed. 

Governments and private companies can also borrow 
money from investors by issuing bonds. Like a bank ac- 
count, a bond pays a (fixed or floating) interest rate on a reg- 
ular basis, the main difference being that the repayment of 
the loan occurs only at a specified time, called the bond ma- 
turity. Another difference is that bonds are not strictly risk- 
free assets because there is always a chance that the bond 
issuer may default on interest payments or (worse) on the 
principal. However, since governments have a much lower 
risk to default than corporations, certain government bonds 
can be considered to be risk free. 

A company can also raise capital by issuing stocks or 
shares. Basically, a stock represents the ownership of a 
small piece of the company. By selling many such 'small 
pieces', a company can raise capital at lower costs than if it 
were to borrow from a bank. As will be discussed shortly, 
stocks are risky financial assets because their prices are sub- 
jected to unpredictable fluctuations. In fact, this is what 
makes stocks attractive to aggressive investors who seek to 
profit from the price fluctuations by pursuing the old advice 
to "buy low and sell high." 

The buying and selling of stocks are usually done in or- 
ganized exchanges, such as, the New York Stock Exchange 
(NYSE) and the Sao Paulo Stock Exchange (BOVESPA). 
Most stock exchanges have indexes that represent some sort 
of average behavior of the corresponding market. Each in- 
dex has its own methodology. For example, the Dow Jones 
Industrial Average of the NYSE, which is arguably the most 
famous stock index, corresponds to an average over 30 in- 
dustrial companies. The Ibovespa index of the Sao Paulo 
Stock Exchange, in contrast, represents the present value of 
a hypothetical portfolio made up of the stocks that altogether 
correspond to 80% of the trading volume. Another well 
known stock index is the Standard & Poor's 500 (S&P500) 
Index calculated on the basis of data about 500 companies 
listed on the NYSE. [Many other risky financial assets, such 
as, currency exchange rates, interest rates, and commodities 
(precious metals, oil, grains, etc), are traded on organized 
markets but these will not be discussed any further in the 
present notes.] 

2.2 The random nature of stock prices 

Since a stock represents a 'small piece' of a company, the 
stock price should somehow reflect the overall value (net 
worth) of this company. However, the present value of a firm 
depends not only on the firm's current situation but also on 
its future performance. So here one sees already the basic 



problem in pricing risky financial assets: we are trying to 
predict the future on the basis of present information. Thus, 
if a new information is revealed that might in one way or 
another affect the company's future performance, then the 
stock price will vary accordingly. It should therefore be 
clear from this simple discussion that the future price of a 
stock will always be subjected to a certain degree of uncer- 
tainty. This is reflected in the typical 'erratic behavior' that 
stock prices show when graphed as a function of time. An 
example of such a graph is shown in Fig. 1 for the case of 
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Figure 1 . Daily closing values of the deflated Ibovespa index in the 
period 1968-2003. 

Although stock prices may vary in a rather unpredictable 
way, this does not mean that they cannot be modeled. It says 
only that they should be described in a probabilistic fashion. 
To make the argument a little more precise, let S be the price 
of a given stock and suppose we want to write an equation 
analogous to (1) for the rate of increase of S: 

f (3) 

where R(t) represents the 'rate of return' of the stock. The 
question then is: what is R(t)l From our previous discus- 
sion, it is reasonable to expect that R(t) could be separated 
into two components: i) a predictable mean rate of return, 
to be denoted by pi, and ii) a fluctuating ('noisy') term 
responsible for the randomness or uncertainty in the stock 
price. Thus, after writing R(t) = pi + £(t) in (3) we have 

f = \t*+at)]s. (4) 

Now, one of the best models for 'noise' is, of course, the 
white noise, so it should not come as a surprise to a physi- 
cist that Brownian motion and white noise play an important 
role in finance, as will be discussed in detail shortly. 

2.3 Options and derivatives 

Besides the primary financial assets already mentioned 
(stocks, commodities, exchange rate, etc), many other fi- 
nancial instruments, such as options and futures contracts, 
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are traded on organized markets (exchanges). These secu- 
rities are generically called derivatives, because they derive 
their value from the price of some primary underlying as- 
set. Derivatives are also sometimes referred to as contingent 
claims, since their values are contingent on the evolution of 
the underlying asset. In the present notes, I will discuss only 
one of the most basic derivatives, namely, options. 

An option is a contract that gives its holder the right, but 
not the obligation, to buy or sell a certain asset for a speci- 
fied price at some future time. The other part of the contract, 
the option underwriter, is obliged to sell or buy the asset at 
the specified price. The right to buy (sell) is called a call 
{put) option. If the option can only be exercised at the future 
date specified in the contract, then it is said to be a Euro- 
pean option. American options, on the other hand, can be 
exercised at any time up to maturity. (For pedagogical rea- 
sons, only European derivatives will be considered here.) To 
establish some notation let us give a formal definition of a 
European option. 

Definition 1 A European call option with exercise price ( or 
strike price) K and maturity (or expiration date) T on the 
underlying asset S is a contract that gives the holder the 
right to buy the underlying asset for the price K at time T. 

A European put option is the same as above, the only 
difference being that it gives the holder the right to sell the 
underlying asset for the exercise price at the expiration date. 

If at the expiration date T the stock price St is above the 
strike price K, the holder of a call option will exercise his 
right to buy the stock from the underwriter at price K and 
sell it in the market at the spot price St, pocketing the dif- 
ference St — K- On the other hand, if at expiration the price 
St closes below K then the call option becomes worthless 
(since it would be cheaper to buy the stock in the market). 
The payoff of a call option at maturity is therefore given by 

payoff caU = max(S T - K, 0). (5) 
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Figure 2. Value of a call option at the expiration date (thick line) 
and before expiration (thin line). 

The payoff diagram of a call option is illustrated by the thick 
line in Fig. 2. In this figure the thin line represents the price 
of the call option at an arbitrary time t < T before expira- 
tion. (The otpion price before expiration is always greater 
than the payoff at expiration on account of the higher risks: 



the further way the expiration date, the greater the uncer- 
tainty regarding the stock price at expiration.) Similarly, the 
payoff function for a put option is 

Payoff put = max(.K - S T , 0), (6) 
which is shown as the thick line in Fig. 3. 

P 
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Figure 3. Value of a put option at the expiration date (thick line) 
and before expiration (thin line). 

Because an option entitles the holder to a certain right 
it commands a premium. Said differently, since the under- 
writer has an obligation (while the holder has only rights) 
he will demand a payment, to be denoted by C , from the 
holder in order to enter into such a contract. Thus, in the 
case of a call option, if the option is exercised the holder 
(underwriter) will make a profit (loss) given by max(S' — 
K, 0) — Co; otherwise, the holder (underwriter) will have 
lost (won) the amount Co paid (received) for the option. And 
similarly for a put option. Note then that the holder and the 
underwriter of an option have opposite views regarding the 
direction of the market. For instance, the holder of a call 
option is betting that the stock price will increase (past the 
exercise price), whereas the underwriter hopes for the oppo- 
site. 

Now, given that the holder and the underwriter have op- 
posite views as to the direction of the market, how can they 
possibly agree on the price for the option? For if the holder 
(underwriter) suspects that the option is overvalued (under- 
valued) he will walk away from the contract. The central 
problem in option pricing is therefore to determine the ra- 
tional price price Co that ensures that neither part 'stands a 
better chance to win.' 

A solution to this problem (under certain assumptions) 
was given in 1973 in the now-famous papers by Black and 
Scholes [12] and Merton [13], which won Scholes and Mer- 
ton the Nobel prize in Economics in 1997. (Black had died 
meanwhile.) The history of options is however much longer. 
In fact, the first scientific study of options dates back to 
the work by the French mathematician Bachelier in 1900 
[14], who solved the option pricing problem above but un- 
der slightly wrong assumptions; see, e.g., [11] for a detailed 
discussion of Bachelier's work. 

After an option (traded on exchange) is first underwrit- 
ten, it can subsequently be traded and hence its 'market 
price' will be determined by the usual bid-ask auction. It is 
nonetheless important to realize that investors in such highly 
specialized market need some basic pricing theory to rely 
on, otherwise investing in options would be a rather wild 
(and dangerous) game. Indeed, only after the appearance 
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of the Black-Scholes model [and the establishment of the 
first option exchange in Chicago also in 1973] have option 
markets thrived. One of the main objectives of the present 
notes is to explain the theoretical framework, namely, the 
Black-Scholes model and some of its extensions, in which 
options and other derivatives are priced. I will therefore not 
say much about the practical aspects of trading with options. 

2.4 Hedging, speculation, and arbitrage 

Investors in derivative markets can be classified into three 
main categories: hedge rs, speculators, and arbitrageurs. 

Hedgers are interested in using derivatives to reduce the 
risk they already face in their portfolio. For example, sup- 
pose you own a stock and are afraid that its price might go 
down within the next months. One possible way to limit 
your risk is to sell the stock now and put the money in a 
bank account. But then you won't profit if the market goes 
up. A better hedging strategy would clearly be to buy a put 
option on the stock, so that you only have to sell the stock 
if it goes below a certain price, while getting to keep it if 
the price goes up. In this case an option works pretty much 
as an insurance: you pay a small price (the option premium 
Co) to insure your holdings against possibly high losses. 

Speculators, in contrast to hedgers, seek to make profit 
by taking risks. They 'take a position' in the market, by bet- 
ting that the price on a given financial asset will go either 
up or down. For instance, if you think that a certain stock 
will go up in the near future, you could "buy and hold" the 
stock in the hope of selling it later at a profit. But then there 
is the risk that the price goes down. A better strategy would 
thus be to buy a call option on the stock. This not only is 
far cheaper than buying the stock itself but also can yield 
a much higher return on your initial investment. (Why?) 
However, if the market does not move in the way you ex- 
pected and the option expire worthless, you end up with a 
100% loss. (That's why speculating with option is a very 
risky business.) 

Arbitrageurs seek to make a riskless profit by entering 
simultaneously into transactions in two or more markets, 
usually without having to make any initial commitment of 
money. The possibility of making a riskless profit, starting 
with no money at all, is called an arbitrage opportunity or, 
simply, an arbitrage. A more formal definition of arbitrage 
will be given later. For the time being, it suffices to give an 
example of how an arbitrage opportunity may arise. 

But before going into this example, it is necessary first 
to discuss the notion of a short sell. 'Shorting' means selling 
an asset that one does not own. For example, if you place an 
order to your broker to short a stock, the broker will "bor- 
row" a stock from somebody else's account, sell it in the 
market, and credit the proceeds into your account. When 
you then decide to close your short position (there usually is 
a limit on how long an asset can be held short), your broker 
will buy the stock in the market (taking the money from your 
account) and return it to its original owner. If in the mean- 
time the stock prices decreased, the short sell brings a profit, 
otherwise the short seller incurs in a loss. This is why a 
short sell is usually done simultaneously with another oper- 



ation to compensate for this risk (as in the arbitrage example 
below). It should also be noted, in passing, that buying the 
actual asset corresponds to taking a 'long position' on this 
asset. 

Let us now consider our hypothetical arbitrage example. 
Many Brazilian companies listed in the Sao Paulo Stock Ex- 
change also have their stocks traded on the New York Stock 
Exchange in the form of the so-called American Deposi- 
tory Receipt (ADR). Suppose then that a stock is quoted in 
Sao Paulo at R$ 100, with its ADR counterpart trading in 
New York at US$ 34, while the currency rate exchange is 
1 USD = 2.90 BRL. Starting with no initial commitment, 
an arbitrageur could sell short N stocks in Sao Paulo and 
use the proceeds to buy N ADR's in New York (and later 
have them transferred to Sao Paulo to close his short po- 
sition). The riskless profit in such operation would be R$ 
(100 - 2.90 x 34)7V = R$ 1.40 N. (In practice, the trans- 
action costs would eliminate the profit for all but large insti- 
tutional investors [1].) 

Note, however, that such 'mispricing' cannot last long: 
buy orders in New York will force the ADR price up, while 
sell orders in Sao Paulo will have the opposite effect on the 
stock price, so that an equilibrium price for both the ADR 
and the stock is soon reached, whereupon arbitrage will no 
longer be possible. In this sense, the actions of an arbi- 
trageur are self-defeating, for they tend to destroy the very 
arbitrage opportunity he is acting upon — but before this hap- 
pens a lot of money can be made. Since there are many 
people looking for such riskless chances to make money, a 
well-functioning market should be free of arbitrage. This is 
the main idea behind the principle that in an efficient mar- 
ket there is no arbitrage, which is commonly known as the 
"no-free-lunch" condition. 

2.5 The no-arbitrage principle in a (binomial) 
nutshell 

Here we shall consider a one-step binomial model to illus- 
trate the principle of no-arbitrage and how it can be used 
to price derivatives. Suppose that today's price of an ordi- 
nary Petrobras stocks (PETR3 in their Bovespa acronym) is 
Sq = 57 BRL. Imagine then that in the next time-period, 
say, one month, the stock can either go up to S*" = 65 with 
probability p or go down to Sf = 53 with probability q. For 
simplicity let us take p = q = 1/2. Our binomial model for 
the stock price dynamics is illustrated in Fig. 4. Note that 
in this case the stock mean rate of return, [i, is given by the 
expression: (1 +p)So = E[Si], where E[S] denotes the ex- 
pected value of S (see Sec. Ill A for more on this notation). 
Using the values shown in Fig. 4, one then gets fi — 0.035 
or (j, = 3.5%. Let us also assume that the risk-free interest 
rate is r — 0.6% monthly. 

Consider next a call option on PETR3 with exercise 
price K = 57 and expiration in the next time period, i.e., 
T = 1. Referring to (5) and Fig. 4, one immediately sees 
that at expiration the possible values (payoffs) for this op- 
tion in our binomial model are as shown in Fig. 5: C" = 8 
or Cf — with equal probability. The question then is to 
determine the 'rational' price Co that one should pay for the 
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option. Below we will solve this problem using two differ- 
ent but related methods. The idea here is to illustrate the 
main principles involved in option pricing, which will be 
generalized later for the case of continuous time. 



S = 57 




Figure 4. One-step binomial model for a stock. 




Figure 6. Delta-hedging portfolio in the one-step binomial model. 

First, we describe the so-called delta-hedging argument. 
Consider a portfolio made up of one option C and a short 
position on A stocks, where A is to be determined later, and 
let Vt denote the money value of such a portfolio at time t. 
We thus have 

V t = C t -AS t , 

where the minus sign denotes that we have short sold A 
stocks (i.e., we 'owe' A stocks in the market). From Figs. 4 
and 5, one clearly sees that the possibles values for this port- 
folio in our one-step model are as illustrated in Fig. 6. Let 
us now chose A such that the value V\ of the portfolio is 
the same in both 'market situations.' Referring to Fig. 6 one 
immediately finds 

VJ* = Vf => 8 - A • 65 = -A • 53 => A = \. 

Thus, by choosing A = 2/3 we have completely eliminated 
the risk from our portfolio, since in both (up or down) sce- 
narios the portfolio has the same value V\. But since this 



portfolio is riskless, its rate of return must be equal to the 
risk-free interest rate r, otherwise there would be an arbi- 
trage opportunity, as the following argument shows. 

Let r' denote the portfolio rate of return, i.e., r' is the 
solution to the following equation 

(l + r')V = V 1 . (7) 

If r' < r, then an arbitrageur should take a long position 
on (i.e., buy) the option and a short position on A stocks. 
To see why this is an arbitrage, let us go through the argu- 
ment in detail. At time t = the arbitrageur's net cash- 
flow would be B = \Vq\ = A • So — Co, which he should 
put in the bank so that in the next period he would have 
Bi = (1 + r)|Vo|. At time t = 1, he should then close his 
short position on A stocks, either exercising his option (up 
scenario) or buying A stocks directly in the market (down 
scenario). In either case, he would have to pay the same 
amount |Vi| = (1 + r')|Vo| < £?i, and hence would be left 
with a profit of B\ — \ V\\. On the other hand, if r' > r the 
arbitrageur should adopt the opposite strategy: go short on 
(i.e., underwrite) the option and long on A stocks (borrow- 
ing money from the bank to do so). 

We have thus shown that to avoid arbitrage we must have 
r' = r. This is indeed a very general principle that deserves 
to be stated in full: in a market free of arbitrage any risk- 
less portfolio must yield the risk-free interest rate r. This 
no-arbitrage principle is at the core of the modern theory of 
pricing derivatives, and, as such, it will be used several times 
in these notes. 

Let us now return to our option pricing problem. Setting 
r' = r in (7) and substituting the values of Vq and V\ given 
in Fig. 6, we obtain 

(l + r)[Co-AS* ]--A5 1 d . (8) 

Inserting the values of r = 0.006, So = 57, Sf = 53, and 
A = 2/3 into the equation above, it then follows that the 
option price that rules out arbitrage is 

C = 2.88. (9) 

It is instructive to derive the option price through a 
second method, namely, the martingale approach or risk- 
neutral valuation. To this end, we first note that from Fig. 5 
we see that the expected value of the option at expiration is 
E[Ci] = 7j8+|0 = 4. One could then think, not to- 
tally unreasonably, that the correct option price should be 
the expected payoff discounted to the present time with the 
risk-free interest rate. In this case one would get 

C = ^ = ^=3.98, 
1 + r 1.006 

which is quite different from the price found in (9). The 
faulty point of the argument above is that, while we used the 
risk- free rate r to discount the expected payoff £?[Ci], we 
have implicitly used the stock mean rate of return fi when 
calculating E[Ci\. Using these two different rates leads to 
a wrong price, which would in turn give rise to an arbitrage 
opportunity. 
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A way to avoid this arbitrage is to find fictitious proba- 
bilities q u and q^, with q u + qd = 1, such that the stock ex- 
pected return calculated with these new probabilities would 
equal the risk-free rate r. That is, we must demand that 

S (l + r) = = q u ■ S? + q d ■ S«, (10) 

where E® [x] denotes expected value with respect to the new 
probabilities q u and qj. Using the values for S" and Sf 
given in Fig. 4, we easily find that 

q u = 0.3618, q d = 0.6382. 

Under these probabilities, the expected value E®[Ci] of the 
option at maturity becomes E^[d] = 0.3618 x 8+0.6382 x 
= 2.894, which discounted to the present time yields 



C = 



1 + r 



2.894 
1.006 



21 



thus recovering the same price found with the delta-hedging 
argument. 

Note that under the fictitious probability q u and qd, all 
financial assets (bank account, stock, and option) in our bi- 
nomial model yield exactly the same riskless rate r. Proba- 
bilities that have this property of 'transforming' risky assets 
into seemingly risk-free ones are called an equivalent mar- 
tingale measure. Martingale measures is a topic of great 
relevance in Finance, as will be discussed in more detail in 
Sec. IV. 

In closing this subsection, it should be noted that the 
one-step binomial model considered above can be easily 
generalized to a binomial tree with, say, N time steps. But 
for want of space this will not be done here. (I anticipare 
here, however, that Black-Scholes model to be considered 
later corresponds precisely to the continuous-time limit of 
the binomial multistep model.) It is perhaps also worth men- 
tioning that binomial models are often used in practice to 
price exotic derivatives, for which no closed formula exists, 
since such models are rather easy to implement on the com- 
puter; see, e.g., [1] for more details on binomial models. 

2.6 Put-Call parity 

In the previous subsection I only considered the price of a 
(European) call option, and the attentive reader might have 
wondered how can one determine the price of the corre- 
sponding put option. It turns out that there is a simple re- 
lationship between European put and call options, so that 
from the price of one of them we can obtain the price of the 
other. To see this, form the following portfolio: i) buy one 
stock S and one put option P on this stock with strike price 
K and maturity T, and ii) short one call option C with the 
same strike and maturity as the put option. The value of such 
portfolio would thus be 



V = S + P - C. 



(11) 



Now from (5) and (6), one immediately sees that at expira- 
tion we have P—C = K — S,so that the value of the above 
portfolio at time T becomes simply 



Since this portfolio has a known (i.e., riskless) value at time 
t = T, it then follows from the no-arbitrage condition that 
its value at any time < t < T must be given by 



V = Ke- r ^ T - t \ 



(13) 



where r is the risk-free interest rate. Inserting (13) into (1 1) 
immediately yields the so-called put-call parity relation: 



P = C - S + Ke- r{T - t] . 



(14) 



3 Brownian motion and stochastic 
calculus 

3.1 One-dimensional random walk 

Every physics graduate student is familiar, in one way or an- 
other, with the concept of a Brownian motion. The custom- 
ary introduction [15] to this subject is through the notion of 
a random walk, in which the anecdotal drunk walks along a 
line taking at every time interval At one step of size I, either 
to the right or to the left with equal probability. The posi- 
tion, X(t), of the walker after a time t = NAt, where N 
is the number of steps taken, represents a stochastic process. 
(See Appendix A for a formal definition of random variables 
and stochastic processes.) As is well known, the probability 
P(X(t) — x) for the walker to be found at a given position 
x = nl, where n is an integer, at given time t, is described 
by a binomial distribution [15]. 

Simply stated, the Brownian motion is the stochastic 
process that results by taking the random walk to the con- 
tinuous limit: At — > 0, I — > 0, N — > oo, n — > oo such that 
t = NAt and x — nl remain finite. (A more formal defini- 
tion is given below.) Here, however, some caution with the 
limits must be taken to ensure that a finite probability den- 
sity p(x, t) is obtained: one must take At — > and I — > 0, 
such that I 2 = a At, where a is a constant. In this case one 
obtains that p(x, t) is given by a Gaussian distribution [15]: 



p(x,t) 



1 



exp 



— )• 

2cr 2 t J 



(15) 



At this point let us establish some notation. Let X be 
a random variable with probability density function (pdf) 
given by p(x). [Following standard practice, we shall de- 
note a random variable by capital letters, while the values it 
takes will be written in small letters]. The operator for ex- 
pectation value will be denoted either as E[-] or < • >, that 
is, 



E[f(X)} = (f(X)) 



F 

J — c 



f(x)p(x)dx, (16) 



V T = K. 



(12) 



where f(x) is an arbitrary function. Although the angular- 
bracket notation for expectation value is preferred by physi- 
cists, we shall often use the E notation which is more con- 
venient for our purposes. 
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A Gaussian or normal distribution with mean m and 
standard deviation a will be denoted by J\f(m, a), whose 
pdf is 

1 ( (x-m) 2 ) 

p*(*,*) = ^exp(-4-L}. (17) 

Let us also recall that the (nonzero) moments of the Gaus- 
sian distribution are as follows 

E[X] = m, E[X 2 } = a 2 , (18) 
E[X 2n ] = l-3-5-...-(2n-l)cr 2 ". (19) 

3.2 Brownian motion and white noise 

We have seen above that a ID Brownian motion can be 
thought of as the limit of a random walk after infinitely 
many infinitesimal steps. This formulation was first given 
in 1900 by Bachelier [14] who also showed the connection 
between Brownian motion and the diffusion equation (five 
years before Einstein's famous work on the subject [16]). It 
is thus telling that the first theory of Brownian motion was 
developed to model financial asset prices! A rigorous math- 
ematical theory for the Brownian motion was constructed 
by Wiener [17] in 1923, after which the Brownian motion 
became also known as the Wiener process. 

Definition 2 The standard Brownian motion or Wiener pro- 
cess {W(t),t > 0} is a stochastic process with the follow- 
ing properties: 

1. W(0) = 0. 

2. The increments W(t) — W(s) are stationary and in- 
dependent. 

3. For t > s, W(t) — W(s) has a normal distribution 
Af{0, y/t=~S). 

4. The trajectories are continuous (i.e., "no jumps"). 

The stationarity condition implies that the pdf of W(t) — 
W(s), for t > s, depends only on the time difference t — s. 

I 



(For a more precise definition of stationary processes see 
Appendix A.) Now, it is not hard to convince oneself that 
conditions 2 and 3 imply that W(t) is distributed according 
to 7V(0, Vt) for t > 0. In particular, we have E[W(t)} = 
for all t > 0. Furthermore, one can easily show that the 
covariance of the Brownian motion is given by 

E[W(t)W(s)] = s, for t > s. 

It is also clear from the definition above that the Brownian 
motion is a Gaussian process (see Appendix A for the for- 
mal definition of Gaussian processes). Then, since a Gaus- 
sian process is fully characterized by its mean and covari- 
ance, we can give the following alternative definition of the 
Brownian motion. 

Definition 3 The standard Brownian motion or Wiener pro- 
cess {W(t), t > 0} is a Gaussian process with E[W(t)] = 
QandE[W{t)W{s)]=mm(s,t). 

The Brownian motion has the important property of hav- 
ing bounded quadratic variation. To see what this means, 
consider a partition {^}™ =0 °f tne interval [0,t], where 
= t < t\ < . . . < t n = t. For simplicity, let us take 

equally spaced time intervals: — ti-\ = At = —. The 

n 

quadratic variation of W(t) on [0, t] is defined as 

n 
i=0 

where AWi = W(U) - W(U-i). Since AWi is distributed 
according to Af(0, y/At) we have that E[AW 2 ] = At, 
which implies that 

E[Q n ] - t. (21) 

Furthermore, using the fact that the increments AWi are in- 
dependent and recalling that the variance of the sum of in- 
dependent variables is the sum of the variances, we get for 
the variance of Q n : 



n n 

var[Q n ] = J2™l AW fi = T,{ E l AW ^-( E l AW ^) 2 } 

i=0 i=0 

= ±[mt) 2 -(At) 2 ] = 2 ^, 



where in the third equality we used (19) and the fact that 
AWi has distribution Af(0, \f~A~t). We thus see that 



On the other hand, we have that 



var[Q n ]=E (Q n - E[Q n ]) 2 



= E 



(Qn-tr 



, (23) 



var[Q n ] — > 0, as n — > oo. 



(22) 



where in the last equality we have used (21). Comparing 



1046 



Giovani L. Vasconcelos 



(22) and (23) then yields 



lim E 



0. 



We have thus proven that Q n converges to t in the mean 
square sense. This fact suggests that AW 2 can be thought 
of as being of the order of At, meaning that as At — > the 
quantity AW 2 "resembles more and more" the deterministic 
quantity At. In terms of differentials, we write 

[dW] 2 = dt. (24) 

Alternatively, we could say that dW is of order \[dt: 

dW = 0(Vdt). (25) 

(I remark parenthetically that the boundedness of the 
quadratic variation of the Brownian motion should be 
contrasted with the fact that its total variation, A n = 
Ym=o |AWj|, is unbounded, that is, A n — > oo as n — > oo, 
with probability 1; see [7].) 

Another important property of the Brownian motion 
W(t) is the fact that it is self-similar (or more exactly self- 
affine) in the following sense: 



W(t) 20 



W(t) 50 



W(at) = a 1/2 W(t), 



(26) 




for all a > 0. Here = means equality in the sense of prob- 
ability distribution, that is, the two processes W(at) and 
a}/ 2 W(t) have exactly the same finite-dimensional distribu- 
tions p(x\,t\\ x n , t n ) for any choice of t i7 i = 1, n, 
and n > 1. Self-similarity means that any finite portion of 
a Brownian motion path when properly rescaled is (statisti- 
cally) indistinguishable from the whole path. For example, 
if we 'zoom in' in any given region (no matter how small) 
of a Brownian motion path, by rescaling the time axis by a 
factor of a and the vertical axis by a factor of y/a, we obtain 
a curve similar (statistically speaking) to the original path. 
An example of this is shown in Fig. 7. In the language of 
fractals, we say that a trajectory of a Brownian motion is a 
fractal curve with fractal dimension D = 2. 

The self-similarity property implies that sample paths of 
a Brownian motion are nowhere differentiable (technically, 
with probability 1). A formal proof of this fact, although 
not difficult, is beyond the scope of the present notes, so that 
here we shall content ourselves with the following heuris- 
tic argument. Suppose we try to compute the derivative of 
W(t) in the usual sense, that is, 



dW ,. AW 
—— = lim — — 
dt At^o At 



lim 
At^o 



W{t + At) - W{t) 
At ' 



But since AW is of order v At, it then follows that 



AW 



= O 



1 



At VVAt 
so that dW/dt = oo as At -> 0. 



(27) 



Figure 7. Self-similarity of a Brownian motion path. In (a) we plot 
a path of a Brownian motion with 15000 time steps. The curve in 
(b) is a blow-up of the region delimited by a rectangle in (a), where 
we have rescaled the x axis by a factor 4 and the y axis by a factor 
2. Note that the graphs in (a) and (b) "look the same," statistically 
speaking. This process can be repeated indefinitely. 

Although the derivative of W(t) does not exist as a reg- 
ular stochastic process, it is possible to give a mathematical 
meaning to dW/dt as a generalized process (in the sense 
of generalized functions or distributions). In this case, the 
derivative of the W(t) is called the white noise process 



dW 



(28) 



I shall, of course, not attempt to give a rigorous definition 
of the white noise, and so the following intuitive argument 

will suffice. Since according to (27) the derivative — — di- 

dt 

verges as —=, a simple power-counting argument suggests 
Vdt 

that integrals of the form 

I(t)= f g(t')Z(t')dt', (29) 
Jo 

should converge (in some sense); see below. 

In physics, the white noise £(t) is simply 'defined' as 
a 'rapidly fluctuating function' [15] (in fact, a generalized 
stochastic process) that satisfies the following conditions 



<£(*)> = o, 

mm) - <*(*• 



t '). 



(30) 
(31) 
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These two relations give the 'operational rules' from which 
quantities such as the mean and the variance of the integral 
I(t) in (29) can be calculated. It is convenient, however, to 
have an alternative definition of stochastic integrals in terms 
of regular stochastic process. Such a construction was first 
given by the Japanese mathematician Ito [18]. 



3.3 Ito stochastic integrals 

Using (28), let us first rewrite integral (29) as an "integral 
over the Wiener process" W(t): 

I(t) = / g(t')dW(t'). (32) 
Jo 

The idea then is to define this integral as a kind of Riemann- 
Stieltjes integral. We thus take a partition {ti}f =0 of the 
interval [0, t] and consider the partial sums 



J 



(33) 



i=i 



r 



The function g(t) above must satisfy certain appropriate 
conditions [7], the most important one being that g(t) be 
a non-anticipating function. This means, in particular, that 
the value ) in (33) is independent of the 'next incre- 

ment' AW(ti) of the Brownian motion. [For this reason, 
choosing to evaluate g(t) at the beginning of the interval 
Ati =ti — ti-i is a crucial point in the definition of the Ito 
stochastic integral. Another possible choice is to evaluate 
g(t) at the mid point t* = (tj_i + U)/2, which leads to the 
Stratonovich integral [8]. In these notes I shall only consider 
Ito integrals.] 

Under the appropriate conditions on g(t), it is then pos- 
sible to show that the partial sums I n converge in the mean 
square sense. That is, there exists a process I(t) such that 



e (i n -i(t)Y 



as 



oo. 



(34) 



Using the fact that g(t) is non-anticipating and that 
E [AW{t)\ = 0, it follows immediately from the definition 
(33) that I(t) has zero mean: 



E[I(t)} = E 



f g{t')dW{£) 
Jo 



= 0, 



(35) 



It is also possible to show that stochastic integrals obey the 
so-called isometry property: 

21 



E 



E 



g(t')dW(t') 



fE[g\t>) 
Jo 



dt'. 



(36) 



We now see that the true meaning of conditions (30) and 
(31) is given by properties (35) and (36), for the particular 
case when g(t) is a deterministic function. 

The Ito integral does not conform to the usual integra- 
tion rules from deterministic calculus. An example is the 
formula below 



/' 

Jo 



WdW 



1 



W(tf 



1 



which is left as an exercise for the reader [19]. Ito integrals 
offer however a convenient way to define (and deal with) 
stochastic differential equations, as we will see next. 



3.4 Stochastic differential equations 

Physicists are quite familiar with differential equations in- 
volving stochastic terms, such as the Langevin equation 



dv 
~dt 



(37) 



which describes the motion of a Brownian particle in a vis- 
cous liquid [15]. Here 7 is the viscosity of the fluid and a is 
the 'amplitude' of the fluctuating force acting on the Brown- 
ian particle. (These parameters are usually considered to be 
constant but in general they could be non-anticipating func- 
tions of time.) Equation (37) does not however make much 
mathematical sense, since it evolves a quantity, namely, the 
derivative £(t) of the Brownian motion, that does not even 
exist (except as a generalized process). Nevertheless, it is 
possible to put this equation on a firm mathematical basis 
by expressing it as a stochastic integral equation. First we 
rewrite (37) as 



dv = —fvdt + adW, 
which upon integration yields 



(38) 



v(t) = v(0)- f jv(t')dt'+ f adW{t'). (39) 
Jo Jo 

This integral equation now makes perfectly good sense — in 
fact, its solution can be found explicitly [19]. 

Let us now consider more general stochastic differential 
equations (SDE) of the form 



dX = a(X, t)dt + b{X,t)dW, 



(40) 
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where a(x,t) and B(x,t) are known functions. Note that 
this 'differential equation' is actually a short-hand notation 
for the following stochastic integral equation 

X(t)=X(0)+[ a(X,t , )dt , + f b(X,t')dW(tf). (41) 
Jo Jo 

Under certain condition on the functions a(x, t) and b(x, t), 
it is possible to show (see, e.g., [8]) that the SDE (40) has a 
unique solution X(t). 

Let us discuss another simple SDE, namely, the Brown- 
ian motion with drift: 



dX = fidt + crdW, 



(42) 



where the constant /j, represents the mean drift velocity. In- 
tegrating (42) immediately yields the process 



X{t)=fti + W{t), 



whose pdf is 

p{x,t) 



2na 2 t 



exp 



(a; - fxtf 
2a 2 t 



(43) 



(44) 



Another important example of a (linear) SDE that can be 
solved explicitly is the geometric Brownian motion that will 
be discussed shortly. But before doing that, let us discuss a 
rather useful result known as ltd lemma or ltd formula. 



where we used the fact that dW 2 = dt and dtdW = 
0(dt 3 / 2 ). (Here we have momentarily omitted the argu- 
ments of the functions a and b for ease of notation.) In- 
serting (47) into (46) and retaining only terms up to order 
dt, we obtain 



dF = 



dF l l2 d 2 F 

h -b 

dt 2 dx 2 



dt 



dF 
b—dX, 
ox 



(48) 



which is known as Ito formula. Upon using (40) in the equa- 
tion above, we obtain Ito formula in a more explicit fashion 



dF 
+ 



dF 
~dt 



a(X,t) 



dF 

dx 



1 d 2 F~l 



dt 



dF 

b{X,t)—dW, 



(49) 



What is noteworthy about this formula is the fact that the 
fluctuating part of the primary process X(t) contributes to 
the drift of the derived process Z(t) = F(t, X) through the 
term \b 2 (t, X)^^-. We shall next use Ito formula to solve 
explicitly a certain class of linear SDE's. 

3.6 Geometric Brownian motion 

A stochastic process of great importance in Finance is the 
so-called geometric Brownian notion, which is defined as 
the solution to the following SDE 



dS = fiSdt + crSdW, 



(50) 



3.5 Ito formula 

Consider the generic process X(t) described by the SDE 
(40), and suppose that we have a new stochastic process Z 
defined by 

Z(t) = F(X(t),t), (45) 

for some given function F(x, t). We now wish to find the lo- 
cal dynamics followed by the Z(t), that is, the SDE whose 
solutions corresponds to the process Z{t) above. The an- 
swer is given by the Ito formula that we now proceed to 
derive. 

First, consider the Taylor expansion of the function 

F(X,t): 



where /i and a are constants, subjected to a generic initial 
condition S(t ) — So- Let us now perform the following 
change of variables Z = In S. Applying Ito formula (49) 
with a = fiS,b = aS and F(S) = In S, it then follows that 



dZ = \n - l -a 2 ) <// 4 ad\V. 



(51) 



which upon integration yields 

Z(t) = Z +(V- \^ (t-t )+<j[W(t)-W(t )}, (52) 

where Z a — In S a . Reverting to the variable S we obtain the 
explicit solution of the SDE (50): 



{dXf 



dF , dF 1V ld 2 F 

dF = m dt+ dx- dx + 2d^ 

1 d 2 F , , N2 1 d 2 F , IV 
+ ^^r{dt) 2 + -^n^dtdX + .. 



2 dt 2 



2dtdx 



Note, however, that 



(46) 



(dX) 2 = b 2 dW 2 + 2abdtdW + a 2 (dt) 2 

= b 2 dt + 0(dt^ 2 ), (47) 



S(t) = So cxp K 2 ^ (t - to) + a[W{t) - W(t )} | 

(53) 

From (52) we immediately see that Z(t) — Zo is dis- 
tributed according to J\f ((/i — |cr 2 ) r, ct^/t), where r = 
t — to- It then follows that the geometric Brownian motion 
with initial value S(t ) = So has the following log-normal 
distribution: 
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p{S,t;S ,t ) 



1 



V2<t 2 tS 



exp < 



M&)-(a*-i°v 



2ct 2 i 



(54) 



The geometric Brownian motion is the basic model for stock 
price dynamics in the Black-Scholes framework, to which 
we now turn. 

4 The Standard Model of Finance 
4.1 Portfolio dynamics and arbitrage 

Consider a financial market with only two assets: a risk- 
free bank account B and a stock S. In vector notation, 
we write S(t) — (B(t), S(t)) for the asset price vector at 
time t. A portfolio in this market consists of having an 
amount x a in the bank and owing X\ stocks. The vector 
x(t) = (x (t),xi(t)) thus describes the time evolution of 
your portfolio in the (B, S) space. Note that Xi < means 
a short position on the ith asset, i.e., you 'owe the market' 
\xi \ units of the ith asset. Let us denote by Vg(t) the money 
value of the portfolio x(t): 

V s = x-S = x Q B + x 1 S, (55) 

where the time dependence has been omitted for clarity. We 
shall also often suppress the subscript from Vg(t) when there 
is no risk of confusion about to which portfolio we are re- 
ferring. 

A portfolio is called self-financing if no money is taken 
from it for 'consumption' and no additional money is in- 
vested in it, so that any change in the portfolio value comes 
solely from changes in the asset prices. More precisely, a 
portfolio x is self-financing if its dynamics is given by 

dV s (t) = x (t) ■ dS(t), t>0. (56) 

The reason for this definition is that in the discrete-time 
case, i.e., t = t n , n = 0,1,2,..., the increase in wealth, 
AV(t n ) = V(t n+ i) — V{t n ), of a self-financing portfolio 
over the time interval t„ +1 — t n is given by 

AV(t„) = x(t n ) ■ AS(t n ), (57) 

where AS(t n ) = S(t n+ i) — S(t n ). This means that over 
the time interval t n+1 — t n the value of the portfolio varies 
only owing to the changes in the asset prices themselves, and 
then at time t n +i re-allocate the assets within the portfolio 
for the next time period. Equation (56) generalizes this idea 
for the continuous -time limit. If furthermore we decide on 
the make up of the portfolio by looking only at the current 
prices and not on past times, i.e., if 

x(t) =x(t,S(t)), 



then the portfolio is said to be Markovian. Here we shall 
deal exclusively with Markovian portfolios. 

As we have seen already in Sec. 2.4, an arbitrage rep- 
resents the possibility of making a riskless profit with no 
initial commitment of money. A more formal definition of 
arbitrage is as follows. 

Definition 4 An arbitrage is a portfolio whose value V(t) 
obeys the following conditions 

0) V(0) = 

(ii) V(t) > with probability 1 for all t > 
(Hi) V(T) > with positive probability for some T > 0. 



The meaning of the first condition is self-evident. The 
second condition says that there is no chance of losing 
money, while the third one states that there is a possibility 
that the portfolio will acquire a positive value at some time 
T. Thus, if you hold this portfolio until this arbitrage time 
there is a real chance that you will make a riskless profit 
out of nothing. [If P(V(T) > 0) = 1 we have a strong 
arbitrage opportunity, in which case we are sure to make a 
profit.] As we have already discussed in Sec. 2.4, arbitrage 
opportunities are very rare and can last only for a very short 
time (typically, of the order of seconds or a few minutes at 
most). In fact, in the famous Black-Scholes model that we 
will now discuss it is assumed that there is no arbitrage at 
all. 

4.2 The Black-Scholes model for option pric- 
ing 

The two main assumptions of the Black-Scholes model are: 

(i) There are two assets in the market, a bank account B 
and a stock £, whose price dynamics are governed by 
the following differential equations 

dB = rBdt, (58) 
dS = nSdt + aSdW, (59) 

where r is the risk-free interest rate, /i > is the stock 
mean rate of return, a > is the volatility, and W(t) 
is the standard Brownian motion or Wiener process. 

(ii) The market is free of arbitrage. 
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Besides these two crucial hypothesis, there are addi- 
tional simplifying (technical) assumptions, such as: (iii) 
there is a liquid market for the underlying asset S as well 
as for the derivative one wishes to price, (iv) there are no 
transaction costs (i.e., no bid-ask spread), and (v) unlimited 
short selling is allowed for an unlimited period of time. It 
is implied by (58) that there is no interest-rate spread either, 
that is, money is borrowed and lent at the same rate r. Equa- 
tion (59) also implies that the stock pays no dividend. [This 
last assumption can be relaxed to allow for dividend pay- 
ments at a known (i.e., deterministic) rate; see, e.g., [4] for 
details.] 

We shall next describe how derivatives can be 'ratio- 
nally' priced in the Black-Scholes model. We consider first 
a European call option for which a closed formula can be 
found. (More general European contingent claims will be 
briefly considered at the end of the Section.) Let us then de- 
note by C(S, t; K, T) the present value of a European call 
option with strike price K and expiration date T on the un- 
derlying stock S. For ease of notation we shall drop the 
parameters K and T and simply write C(S,t). For later 
use, we note here that according to Ito formula (49), with 
a = fiS and b = crS, the option price C obeys the following 
dynamics 



Inserting this back into (62), we then find 



dC 



BC BC 1 



2 S 2 



b 2 c 

dS 2 



BC 

dt + aS—dW. 

(60) 

In what follows, we will arrive at a partial differential 
equation, the so-called Black-Scholes equation (BSE), for 
the option price C(S, t). For pedagogical reasons, we will 
present two alternative derivations of the BSE using two dis- 
tinct but related arguments: i) the A-hedging portfolio and 
ii) the replicating portfolio. 

4.2.1 The delta-hedging portfolio 

As in the binomial model of Sec. 2.5, we consider the self- 
financing A-hedging portfolio, consisting of a long position 
on the option and a short position on A stocks. The value 
II(£) of this portfolio is 

n(t) = c(s,t) - a s. 

Since the portfolio is self-financing, it follows from (56) that 
II obeys the following dynamics 



dU = dC - A dS, 
which in view of (59) and (60) becomes 



dC BC 1 2ci2 B 2 C 



( ^ a ,nv. 



(61) 

dt 
(62) 



We can now eliminate the risk [i.e., the stochastic term 
containing dW] from this portfolio by choosing 



A = 



BC 
dS' 



(63) 



cm 



BC 1 2 2 8 2 C 
~BJ + 2 <T S 3^ 



dt. 



(64) 



Since we now have a risk-free (i.e., purely deterministic) 
portfolio, it must yield the same rate of return as the bank 
account, which means that 



dll = rlldt. 



(65) 



Comparing (64) with (65) and using (61) and (63), we then 
obtain the Black-Scholes equation: 



BC 1 2ci2 B 2 C BC _ n 
1H + 2° S BS^ + rS BS- rC = °> 



(66) 



which must be solved subjected to the following boundary 
condition 



C(S, T) = max(S — K,0). 



(67) 



The solution to the above boundary-value problem can be 
found explicitly (see below), but before going into that it is 
instructive to consider an alternative derivation of the BSE. 
[Note that the above derivation of the BSE remains valid 
also in the case that r, fj,, and, a are deterministic functions 
of time, although a solution in closed form is no longer pos- 
sible.] 

4.2.2 The replicating portfolio 

Here we will show that it is possible to form a portfolio on 
the (B, S) market that replicates the option C(S, t), and in 
the process of doing so we will arrive again at the BSE. 
Suppose then that there is indeed a self-financing portfo- 
lio x(t) = (x(t),y(t)), whose value Z(t) equals the option 
price C(S, t) for all time t < T: 



Z = xB + yS = C, 



(68) 



where we have omitted the time-dependence for brevity. 
Since the portfolio is self-financing it follows that 

dZ = xdB + ydS = (rxB + (iyS)dt + aySdW. (69) 

But by assumption we have Z = C and so dZ = dC. 
Comparing (69) with (60) and equating the coefficients sep- 
arately in both dW and dt, we obtain 



y = 

BC 
~3t 



BC 
dS' 



rxB + —a 



2 S 2 



3 2 C 
BS 2 



0. 



Now from (68) and (70) we get that 



1 

B 



C-S 



BC' 

~BS 



(70) 
(71) 

(72) 
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which inserted into (71) yields again the BSE (66), as the 
reader can easily verify. 

We have thus proven, by direct construction, that the op- 
tion C can be replicated in the (B, S) -market by the port- 
folio (x, y), where x and y are given in (72) and (70), re- 
spectively, with option price C being the solution of the 
BSE (with the corresponding boundary condition). [To com- 
plete the proof, we must also show that the initial price 
Co = C(S, 0) is the 'correct' one, in the sense that if the op- 
tion price were C ^ Co, then there would be an arbitrage 
opportunity. In fact, if C' Q > Co an arbitrageur should short 
the option and invest in the replicating portfolio, whereas if 
C' < Co he should do the opposite.] 

4.3 The Black-Scholes formula 

Here we will solve equation (66) subjected to the boundary 
condition (67). Following the original work of Black and 
Scholes [12], the idea is to perform a change of variables so 
as to turn the BSE into the heat equation, which we know 
how to solve. Here we will not use the original transfor- 
mation employed by these authors but a related one [6], as 
shown below: 



T-t 



T = 



x = In 



S 



2/cr 2 ' ~ ~"V^ 

_ otx-\-(3 T 

C(S,t) 



u(x, t) = e 



K 



where 



Q 



2r 



1 



2r 



1 



(73) 
(74) 

(75) 



After a somewhat tedious but straightforward algebra 
[6], one obtains that in the new variables equation (66) reads 



du d 2 u 
dr dx 2 ' 



(76) 



while the terminal condition (67) becomes an initial condi- 
tion 



u(x, 0) = uq(x) 



°.0) 



(77) 



We now recall that the Green's function for the heat 
equation is 



G{x,x') = 



-(x-x') 2 /4t 



so that its generic solution for an arbitrary initial condition 
Uo(x) is given by 

/oo 
uo(s')G(s, x')dx' 
-oo 

1 f°° 

= -L= / uoixy-^'^/^dx'.os) 

V4.7TT J_oo 

Inserting (77) into the integral above we obtain 



u(t, x) 



X= r - e aX ') e-^- x '^/ 4T dx' 

4.7TT Jo ^ ' 



1(0) -1(a), 



where 



1(a) 



<*-*'?l^dx' 



(79) 



(80) 



After completing the squares and performing some simplifi- 
cation, we find that 



where 



1(a) = e 



d„ 



ax+a r 



N(d a ), 



x + 2ar 



(81) 



(82) 



and N(x) denotes the cumulative distribution function for a 
normal variable A/"(0, 1): 



N(x) = 



1 



2 / 2 ds. 



(83) 



Inserting (81) into (79) and reverting back to the original 
dimensional variables, we obtain the famous Black-Scholes 
formula for the price of a European call option: 



where 



C(S,t) = SN(dx) - Ke- r{T - t ^N(d 2 ) 1 
ln(f) + (r+ia 2 )(T-i) 



di 



d 2 



Mf) + (r-k 2 )Cr-t) 

ay/T-t 



(84) 

(85) 
(86) 
(87) 



This formula is so often used in practice that it is already 
pre-defined in many software packages (e.g., Excel, Mat- 
lab, Maple, etc) as well as in most modern hand calculators 
with financial functions. It should noted, however, that many 
people (academics and practitioners alike) believe that the 
Black-Scholes model is too idealized to describe real mar- 
ket situations; see Sees. V and VII for a brief discussion of 
possible extensions of the BS model. 

4.4 Completeness in the Black-Scholes model 

We have seen above that it is possible to replicate a Euro- 
pean call option C(S, t) using an appropriate self-financing 
portfolio in the (B, S) market. Looking back at the ar- 
gument given in Sec. 4.2.2, we see that we never actually 
made use of the fact that the derivative in question was a call 
option — the nature of the derivative appeared only through 
the boundary condition (67). Thus, the derivation of the BSE 
presented there must hold for any contingent claim! 

To state this fact more precisely, let i^S*, t) represent the 
price of an arbitrary European contingent claim with payoff 
F(S,T) = &(S), where $ is a known function. Retracing 
the steps outlined in Sec. 4.2.2, we immediately conclude 
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that the price F(S, t) will be the solution to the following 
boundary- value problem 



Ik 



1 2 q2 d 2 F 
2 aS dS^ 



dF 

rS— -rF = , (88) 
F(S,T) = <f>(S) . (89) 



Furthermore, if we repeat the arguments of preceding sub- 
section and transform the Black-Scholes equation (88) into 

I 



F(S,t) 



-r(T-i) 



the heat equation, we obtain that F(S, t) will be given by 

F(S,t)=-= ^(x'y-^-^/^dx', (90) 

VA-KT J-oo 

where $(x) denotes the payoff function in terms of the di- 
mensionless variable x; see (73). Expressing this result in 
terms of the original variables S and t yields a generalized 
Black-Scholes formula 



^27TCr 2 (T-t) 



Jo & 



(91) 



In summary, we have shown above that the Black- 
Scholes model is complete. A market is said to be com- 
plete if every contingent claim can be replicated with a 
self-financing portfolio on the primary assets. Our 'proof 
of completeness' given above is, of course, valid only for 
the case of European contingent claims with a simple pay- 
off function &(S); it does not cover, for instance, path- 
dependent derivatives. It is possible however to give a for- 
mal proof that arbitrage-free models, such as the Black- 
Scholes model, are indeed complete; see Sec. 5.3. 

Comparing the generalized Black-Scholes formula (91) 
with the pdf of the geometric Brownian motion given in 
(54), we see that the former can be written in a convenient 
way as 

F{S,t) = e- r < T -*)E? iS [*{ST)], (92) 

where Ef s [•] denotes expectation value with respect to the 
probability density of a geometric Brownian motion with 
H = r, initial time t, final time T, and initial value S; see 
(54). In other words, the present value of a contingent claim 
can be computed simply as its discounted expected value 
at maturity, under an appropriate probability measure. This 
idea will become more clear after we discuss the notion of 
an equivalent martingale measure. 

5 Efficient markets: the martingale 
approach 

5.1 Martingales 

The concept of a martingale plays a important role in finance 
[20]. Unfortunately, a proper introduction to martingales re- 
quires some knowledge of probability measure theory [21]. 
Here however we will give a rather intuitive discussion of 
martingales. For completeness we have listed in Appendix 
A some basic concepts from probability theory that would 
be required to make the following discussion more rigorous. 

We begin by recalling that a probability space is a triple 
(Q,T, P), where 



• Ct is the space of elementary events or outcomes lo. 

• T is a properly chosen family of subsets of ft, (i.e., a 
cr-algebra on ft). 

• P is a probability measure on T . 

In Finance, an outcome w is a 'market situation.' The 
family of subsets T specifies the class of events to which 
probabilities can be assigned. This is done through the 
concept of a cr-algebra, whose formal definition is given 
in Appendix A. A probability measure P on T is simply 
a function P : T — > [0, 1], satisfying a few 'obvious re- 
quirements': P(0) = 0, P(Q) = 1, and P{A 1 U A 2 ) = 
P(Ai) + P(A 2 ) if A x n A 2 = 0. An element A of T, 
A e T, is called a "measurable set" or "observable event," 
meaning that it is possible to assign a "probability of occur- 
rence," P(A) g [0,1], for this event. Hence, T is said to be 
the set of 'observable events.' 

Suppose now that we have a random function X : 
fl — > R. If to every subset of ft of the form {uj : a < 
X(w) < b} there corresponds an event A C T, then the 
function X is said to be measurable with respect to T or 
simply ^-measurable. What this means is that it is pos- 
sible to 'measure' (i.e., assign a probability to) events of 
the form {a < X < b} through the obvious definition: 
P({a < X < b}) = p(A). A J^-measurable function X 
is called a random variable. 

Let us next consider the notion of an "information flow." 
In a somewhat abstract way, we will represent the informa- 
tion available to an observer up to time t as a cr-algebra 
Tt C T . In the context of Finance, the information set 
T t would contain, for instance, the price history up to time 
t of all assets in the economy. It is natural to assume that 
T s C T t for s < t, since we expect that, as time goes on, 
we gain new information (and do not discard the old ones). 
Such a collection of cr-algebras represents an "information 
flow," or, more technically, a filtration. 

Definition 5 A filtration or information flow is a collection 
{fijoo of a -algebras Tt C T such that 

T s C T u for < s < t. 
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Suppose now we have a stochastic process X t defined on 
(CI, J- ', P). (Think of X t as being, say, the price of a given 
stock.) If the values of X t can be completely determined 
from the information T t , for all t > 0, then the process X t 
is said to be adapted to the filtration {T t }t>a- 

Definition 6 The process X t is adapted to the filtration 
{Ft}t>o if X t is T t -measurable for all t > 0. 

A stochastic process X t naturally generates an informa- 
tion flow, denoted by JF t x , which represents the "informa- 
tion contained in the trajectories of X(t) up to time i." A 
process X(t) is obviously adapted to its natural filtration 

•>t ■ 

A last piece of mathematics is necessary to define a mar- 
tingale, namely, the notion of conditional expectation. This 
appears in connection with the crucial question of how the 
information, T ta , available at present time influences our 
knowledge about future values of X(t). If X(t) and F to 
are not independent, then it is reasonable to expect that the 
information available up to the present time reduces the un- 
certainty about the future values of X(t). To reflect this 
gain of information is useful to introduce a new stochastic 
process 

Z(t) = E[X t \F t0 ], t>t , 

where the symbol S[X t |jF to ] represents "the expected value 
of X t , contingent on the information gathered up to time to?' 

A precise definition of conditional expectation is beyond 
the scope of these notes. Here it will suffice to say that given 
a random variable Y on a probability space (CI, T , P) and 
another cr-algebra T' C T, it is possible to define a ran- 
dom variable Z = E[Y\ T'\, which represents "the expected 
value of Y, given the information contained in J 7 '." The 
variable Z is a coarser version of the original variable Y, in 
the sense that we have used the information on T' to reduce 
the uncertainty about Y. The following two properties of 
conditional expectations will be necessary later: 

E[Y\T t ] = y, if y is ^-measurable. (93) 
E[E[Y\T t ]] = E[Y). (94) 

The first property above is somewhat obvious: if Y is Tt- 
measurable then Y and T t 'contain the same information,' 
hence taking expectation of Y conditional to Tt does not re- 
duce the uncertainty about Y . The second property is the 
so-called law of iterated expectations, and basically repre- 
sents the law of total probability. 

After these mathematical preliminaries, we are now in a 
position to define martingales. 

Definition 7 A stochastic process M t is called a martingale 
with respect to the filtration {T t }t>o if 

(i) M t is adapted to the filtration {T t }t>o 

(ii) E[\M t \] < oo for all t> 

(Hi) E[M t \T t0 ] = M to forall t > t 



Condition (i) simply says that M t can be determined 
from the information available up to time t, whereas con- 
dition (ii) is a technicality. The defining property of a mar- 
tingale is therefore condition (iii), which is usually referred 
to as the martingale condition. It says that the best predic- 
tion of future values of the process M(t), contingent on the 
information available at the present time, is the current value 
M t0 . 

Because of property (iii), a martingale is usually de- 
scribed as a "fair game." To see why this is so, suppose that 
M n represents the fortune of a gambler at time n. (For con- 
venience let us assume here that time is discrete.) The differ- 
ence h n — M n — M„_ i is then the amount the gambler wins 
on the nth play (a negative win is of course a loss). Now let 
us compute the gambler's expected gain on the (n + l)th 
play, given the information up to time n: 

E[h n+1 \T n ] = E[M n+1 - M n \T n ] 

= E[M n+1 \T n ] - E[M n \T n ] 

= M n -M n 

= 0, (95) 

where in the third equality we used the martingale property 
and rule (93). We thus have that at each new play of the 
game the expected gain is null, and in this sense it is a "fair" 
game. 

A Brownian motion W(t) is a martingale with respect 
to its natural filtration. To show this, we only need to verify 
the martingale condition (since the other two conditions are 
trivially fulfilled): 

E[W(t)\T tQ ] = E[W(t)-W(t ) + W(t )\T to ] 

= E[W(t)-W(t )\F tQ \ + E[W(t )\F t0 ] 
= + W(t ) 

= W(t ). (96) 

In the third equality above we used the fact that the incre- 
ments W(t) — W(to) are independent of Tt a and have zero 
mean, together with property (93). It is also possible to 
show that Ito stochastic integrals are martingales. Indeed, 
the theory of stochastic integration is intimately connected 
with martingale theory [5]. 

Another important property of martingales is that their 
expected value remains constant in time: 

E[M ] = E[E[M t \Fo]] = E[M t ], 

where in first equality we used the martingale property, 
while in the second equality property (94) was used. Thus, 
a necessary (but not sufficient) condition for a process to 
be a martingale is that it have no drift. Therefore, a diffu- 
sion process of the form (40) is not a martingales unless the 
drift term vanishes. For this reason, the geometric Brown- 
ian motion (50) is not a martingale. It is possible, however, 
to introduce a new probability measure Q, with respect to 
which the geometric Brownian motion becomes a standard 
Brownian motion and hence a martingale, as discussed next. 



1054 



Giovani L. Vasconcelos 



5.2 Equivalent martingale measures 

Recall that a probability measure P on a measurable space 
(CI, J 7 ) is a function P : T — > [0, 1] that assigns to every 
event A C T a real number P(A) € [0, 1]. Suppose now 
we have another probability measure Q defined on the same 
space (CI, J 7 ). We say that the probability measures P and 
Q are equivalent if the following condition is satisfied: 

Q(A) = P(A) = 0, for all A e T. (97) 

To get a better grasp on the meaning of the condition above, 
consider a random variable X [on (CI, T ', P)]. If Q is a prob- 
ability measure equivalent to P, then condition (97) implies 
that there exists a function p(X) such that expected values 
w.r.t Q are calculated in the following way 

E Q [g(X)\ = E P [p(X)g(X)], (98) 

where g(x) is an arbitrary function. Alternatively, we may 
write (98) in terms of probability densities: 

f Q (x) = p(x)f P (x), (99) 

where fp(x) denotes the probability density of X w.r.t the 
measure P and Jq(x) is the density w.r.t Q. (The function 
p(x) is called the Radon-Nikodym derivative of measure Q 
with respect to measure P [21].) 

Consider now the Brownian motion with drift 

W(t) = at+ W(t), 0<t<T (100) 

where a is some constant. (The finite-horizon condition 
t < T is a technicality that is not relevant for our pur- 
poses.) As already noted, W(t) is not a martingale (since 
its expected value is not constant). However, there is an 
equivalent probability measure Q, with respect to which the 
process W(t) becomes the standard Brownian motion (and 
hence a martingale). This result is known as Girsanov theo- 
rem. 

Theorem 1 (Girsanov theorem) The process W(t) given 
in (100) is a standard Brownian motion with respect to the 
probability measure Q defined by 

f Q (x,t) = M t (x)f P (x,t), (101) 

where M t is the process 

M t = cxp |-aWt - ^a 2 *| = exp |-oWi + ^ fl2 *} ■ 

(102) 

Proof. For a formal proof see, e.g., [8]. Here we shall only 
sketch a proof of the fact that the process W(t) is indeed 
distributed according to Af(Q, Vt), as the standard Brown- 
ian motion. First recall from (44) that the probability density 
fp(x, t) of W(t) under the original measure P is 

Mi , = J_« p {_(£_ial}. (10 3, 



Now according to (101) and (102) we have that 

f Q (x,t) = e- ai+1 * a2t fp(x,t). (104) 
Inserting (103) into (104) then yields 




(105) 



which is precisely the pdf for the standard Brownian motion. 
Q.E.D. 

One of the main applications of change of measures is to 
eliminate the drift in stochastic differential equations, so that 
with respect to the new measure Q the process is a martin- 
gale. The measure Q is then called an equivalent martingale 
measure. Constructing the equivalent martingale measure 
for an arbitrary SDE of the form (41) is a rather complicated 
procedure [5]. One important exception are linear SDE's 
where the Girsanov theorem gives the measure transforma- 
tion in an explicit form, as shown below. 

Consider the geometric Brownian motion discussed in 
Sec. 3.6. For technical reasons [8], let us restrict ourselves 
to its finite-horizon version: 

dS = pSdt + aSdW, t<T. (106) 

where p and a are positive constants. This equation can then 
be rewritten as 

dS = aS^dt + dW^j=aSdW, (107) 

where 

W t = (p/a)t + W t , t<T. (108) 

Now, according to Girsanov theorem, W t is a standard 
Brownian motion with respect to the measure Q given in 
(101) with a = p/a, and since the SDE (107) has no drift, 
its solution S t is a martingale w.r.t. the measure Q. 

5.3 The 'efficiency symmetry' and the no- 
arbitrage condition as its 'conservation 
law' 

The notion of an efficient market plays a crucial role in Fi- 
nance. Roughly speaking, in an efficient market all rele- 
vant information is already reflected in the prices [3]. This 
means, in particular, that past prices give no additional in- 
formation that is not already contained in the current price. 
In an efficient market prices thus adjust immediately to the 
arrival of new information. Since the content of future infor- 
mation and their effect on prices are unknown, it should be 
impossible to make definite predictions about future price 
based on the information available today. Thus, the best 
prediction for the expected future price (discounted to the 
present time) should be today's price. In other words, in 
a efficient market 'discounted prices' should be a martin- 
gale. This intuitive definition of efficiency is formalized be- 
low. [For technical reasons the results of this section will be 
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restricted to the discrete-time case. Their extension to the 
continuous-time framework, although possible to a large ex- 
tent, is more complicated and will not be attempted in the 
present notes; see, e.g., [5].] 

Consider a market formed by two assets (B,S), where 
B is our usual risk-free free asset and S is a risky asset. We 
suppose that S(t) follows a stochastic process on a proba- 
bility space (SI, T , P) endowed with a filtration {T t }t>o- 

Definition 8 Suppose the market (B, S) operates at dis- 
crete time t n , n = 1,2, .... This market is said to be ef- 
ficient if there exists (at least one) probability measure Q, 

S(t) 

equivalent to P, such that the 'discounted price' 



martingale with respect to the measure Q, that is, 



B(t) 



E Q 



B(t) 



to 



S(t ) 

B(t y 



for to < t, 



(109) 



where means expected value w.r.t. the measure Q. 

The requirement of efficiency, as defined above, is some- 
what reminiscent of a symmetry principle in Physics. In- 
deed, we can recast definition (109) by saying that in a effi- 
cient market there exists a 'special measure Q' with respect 
to which discounted prices are invariant under a sort of 'time 
translation,' in the following sense: 



E Q 



S(t + T) 
B(t + T) 



\Tt 



B(ty 



(HO) 



for any T > 0. 

In Physics, symmetry principles are intimately con- 
nected with conservation laws. (Recall, for instance, that 
the invariance of Newton's law under time translation im- 
plies conservation of energy.) It is thus only natural to 
ask whether the 'efficiency symmetry' above also leads to 
a 'conservation law.' Perhaps not surprisingly, this is indeed 
the case, as stated in the following theorem, which is some- 
times referred to as the First Fundamental Theorem of asset 
pricing. 

Theorem 2 Suppose the market (B, S) operates at discrete 
time t n , n = 1, 2, .... Then this market is efficient if and only 
if it is arbitrage-free. 

(See [5] for a proof.) 

Recall that absence of arbitrage means that any self- 
financing portfolio with zero initial value, and with no 
chance of becoming negative, will remain zero-valued for 
all subsequent times. More precisely, the no-arbitrage con- 
dition says that 



V(0) = and V(t) > a.s. 



V(t) 



a.s., 
(HI) 

where a.s. means almost surely, i.e., with probability 1. The 
absence of arbitrage can thus be interpreted as a kind of 
'conservation law' for the "vacuum state" of the market: if 
you start in a state with zero initial money and do not take 



any risks, then you remain at this state for all times. I thus 
find it quite interesting that the so-called "no-free-lunch" 
condition can actually be seen as the conservation law as- 
sociated with the efficiency symmetry. 

Another very important result is the Second Fundamen- 
tal Theorem of asset pricing, linking the completeness of a 
market to the uniqueness of its equivalent martingale mea- 
sure. 

Theorem 3 An arbitrage-free (B, S)-market is complete if 
and only if the equivalent martingale measure Q is unique. 

(See [5] for a proof.) 

We already know that the Black-Scholes model is com- 
plete. Below we will calculate its equivalent martingale 
measure explicitly, and it will become clear from the con- 
struction that it is indeed unique. 



5.4 Pricing derivatives with the equivalent 
martingale measure 



The notion of an equivalent m 
to price derivatives in a rather 
solve a PDE. The idea is that 
nancial assets are martingales 
martingale measure Q. More 
tingent claim with maturity T 
then from (109) we have 



artingale measure can be used 
direct way, without having to 
in an efficient economy all fi- 
with respect to the equivalent 
precisely, if F(S, t) is a con- 
and payoff function <&(S(T)) 



F(S,t) 
B(t) 



= e: 



Q 

t,s 



B(T) 



(112) 



where the subscripts t, S denote that the expected value is 
taken at present time t and with current value 5*, i.e., condi- 
tional to the information available at time t. It is not hard to 
convince oneself that if the derivative price F were not given 
by (1 10), then there would be an arbitrage opportunity. 

In the Black-Scholes model, the risk-free asset is a bank 
account with fixed interest rate r, i.e., B(t) = e rt , so that 
(112) becomes 



F(S,t) 



r(T-t) 



Egs MS(T))} , (113) 



or 



/•CO 

F(S,t)^e- r ^ T - t U 1>(S')f Q (S',T;S,t)dS', (114) 
Jo 

where Jq(S, t; So, to) denotes the probability density, under 
the equivalent martingale measure Q, of the process S(t) 
with initial value S(to) = So- All that remains to be done 
now is to find the equivalent martingale measure Q for the 
Black-Scholes model. To do this, consider the process 



Z(t) 



B(t) 



e - rt s(t). 



(115) 
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We then have 



which in terms of the process W(t) given in (116) reads 



where 



dZ = -re- rt Sdt + e- rt dS 
= (fi-r)Zdt + <jZdW 
= vZdW, 



W(t) = [(n-r)/a]t + W(t). 



(116) 



Now recall that in the Black-Scholes model the stock price 
follows a geometric Brownian motion 



dS = nSdt + aSdW, 



(117) 

I 



dS = rSdt + crSdW. 



(118) 



From Girsanov theorem we know that there is a equiv- 
alent martingale measure Q that turns W(t) into a Brown- 
ian motion. Equation (118) then shows that w.r.t the mea- 
sure Q the price S(t) follows a geometric Brownian motion 
with mean rate of return equal to r. The probability density 
/q(S",T; S, t) can now be obtained directly from (54), by 
simply setting fi = r and So = S. One then gets 



fn(S', T; S, t) = == exp < 

where r = T — t. Inserting (119) into (1 14) we obtain 

-r(T-t) 



In (f ) - (r - |a 2 ), 



2<T 2 1 



(119) 



F(t,S) = 



$C5') e [ln(57S)^(r-i<T 2 )(T-t)] 2 ^ 

^2wa 2 (T~t) Jo S' ' 



(120) 



which is precisely the expression obtained for the generic 
solution of the Black-Scholes equation given in (92). In the 
case of a European call option we have <5>(5') = max(S" — 
K, 0), which inserted into (120) yields, after some algebra, 
the Black-Scholes formula (84). 

It is interesting to notice that under the equivalent mar- 
tingale measure Q the stock price in the Black-Scholes 
model follows a geometric Brownian motion with the mean 
rate of return equal to the risk-free interest rate r; see (118). 
It is as if all investors were risk neutral, in the sense they 
would be willing to invest on a risky stock even though its 
expected return is just what a risk-free bank account would 
yield. For this reason, the pricing method based on the 
equivalent martingale measure is commonly referred to as 
risk neutral valuation. Of course, actual investors are not 
risk neutral. However, in an efficient market there is a 'spe- 
cial reference frame' where investors can be treated as if 
they were indeed insensitive to risk. 

6 Beyond the Standard Model of Fi- 
nance I: Non-Gaussian Distribu- 
tions 

We have seen above that the Black-Scholes model is an el- 
egant and powerful theoretical construct: it is complete, ef- 
ficient, arbitrage-free, and Gaussian (in the sense that the 
stock returns are normally distributed). It is thus important 
to ask whether real markets actually fit into this nice frame- 
work. 



There are two main ways in which real markets may de- 
viate from the standard Black-Scholes model: i) the returns 
may not be normally distributed or ii) there may exist long- 
memory effects on the time series. In this Section we will 
discuss the possibility that asset prices may follow a non- 
Gaussian stable Levy process, while in the next section we 
investigate whether financial data might exhibit long-term 
memory. 

Mandelbrot [22] in 1963 was perhaps the first person 
to challenge the paradigm that returns are normally dis- 
tributed. He analyzed cotton prices on various exchanges 
in the United States and found evidences that their distribu- 
tion of returns decays as a power law and hence much slower 
than a Gaussian. An example of this 'fat tail' behavior can 
be seen in Fig. 8, where I plot the distribution for the returns 
of the Ibovespa index. In this figure, it is also shown a Gaus- 
sian distribution with the variance of the data, which appears 
as a parabola in the linear-log scale of the graph. One clearly 
sees that the empirical distribution does indeed have 'fatter 
tails' when compared with the Gaussian distribution. 

The Gaussian distribution is special for two main rea- 
sons. The first one is the Central Limit Theorem [21] that 
states that the sum of infinitely many independent random 
variables (with finite variance) will converge to a Gaussian 
variable. The second one is the fact that it is a stable distri- 
bution, in the sense that the sum of two independent Gaus- 
sian random variables is also a Gaussian variable. It is thus 
natural to ask whether there are other stable distributions. 
The French mathematician Paul Levy showed that there is 
indeed a whole family of stable distributions of which the 
Gaussian is but one particular case. In what follows, I will 
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first introduce the so-called Levy stable distributions and 
then briefly discuss their possible applications to financial 
data. 




r 



Figure 8. Distribution of the daily Ibovespa returns (open circles). 
The solid line correspond to a Gaussian distribution with the same 
variance as that of the empirical distribution. 

6.1 The stable Levy distributions 

Let X be a random variable with probability density func- 
tion (pdf) given by p(x). We recall that the characteristic 
function ip(z) of a random variable X is the Fourier trans- 
form of its pdf p(x): 

/OO 
p{x)e lzx dx. (121) 
-oo 

Now let Xi and X 2 be two independent random variables 
with pdf's p\{x\) and p 2 (x 2 ). Since X\ and X 2 are inde- 
pendent, the pdf of the sum X = X\ + X 2 is the convolution 
of the original pdf's: 

/oo 
Pi(s)p 2 (x - s)ds. 
-oo 

Let now <p[z, 2) denote the characteristic function of X. 
In view of the convolution theorem which states that the 
Fourier transform of the convolution is the product of the 
Fourier transforms, it then follows that 

<p{z,2) = <p 1 {z)<p 2 {z)- (122) 

Suppose furthermore that X\ and X 2 are identically dis- 
tributed, that is, 

X!=X 2 , (123) 

or alternatively 

Vl (z) = tp 2 (z) = <p{z). (124) 

(Recall that the symbol = denotes equality in the distribu- 
tion sense.) From (122) and (124) it then follows that 

<p{z,2) = [<p{z)] 2 . 

In general, if X = Xi, wnere the JQ's are inde- 

pendent and identically distributed (i.i.d.) random variables, 
then 

<p{z,N) = [<p{z)] N , (125) 



from which the pdf of X can be obtained by calculating the 
inverse Fourier transform. Note that the pdf of the sum of 
N i.i.d. random variables will in general be quite different 
from the pdf of the individual variables. There is however 
a special class of distribution, the stable distributions, for 
which the pdf of the sum has the same functional form of 
the individual pdf's. 

Definition 9 A probability distribution p(x) is stable if for 
each N > 2 there exist numbers > and 6/v, such that, 
if Xi, Xn are i.i.d. random variables with distribution 
p{x), then 

X 1 + ... + X A r = a „X, + & Af . (126) 

In other words, a distribution is stable if its form is in- 
variant under addition, up to a rescaling of the variable by 
a translation and a dilation. More precisely, if p(x, N) de- 
notes the probability density of X = Xi, wnere the 
Xi's are i.i.d. variables with a stable distribution p(x), then 
(126) implies that 

p(x,N) = ±p(°^-). (127) 
cin \ a N J 

Stability of probability distributions has a nice interpretation 
from an economic standpoint. To see this, suppose that the 
variables Xi represent daily increments of a given financial 
asset. Stability then means preservation of the distribution 
under time aggregation, which is a rather natural property to 
expect from financial data. 

As already mentioned, the Gaussian distribution is sta- 
ble. To see this, recall that the Fourier transform of a Gaus- 
sian is a Gaussian and that the product of Gaussians is again 
a Gaussian, so that from (125) it follows that the characteris- 
tic function ip(z, N) will indeed be that of a Gaussian vari- 
able. More precisely, we have that the characteristic func- 
tion of a normal variable Af(0,a) is tp(z) = e~( CT / 2 ) z , 
which inserted into (125) immediately yields cp(z,N) = 
e~( JV<T / 2 ) z . Thus, the sum of N i.i.d. normal variables is 
normally distributed with standard deviation y/N a, that is, 

X = Af(0, \fN a), which in turn implies that 

p{x > n) = 7n p {tn)- (128) 

Thus, in the case of the Gaussian distribution we have ajy = 
l/y/N and b N = 0. 

The class of stable distributions is rather small and was 
completely determined by the mathematicians P. Levy and 
A. Ya. Khintchine in the 1920's. Here we shall restrict our- 
selves to the subclass of symmetric distribution. In this case, 
the characteristic function is given by 

<p a (z) = e-W , (129) 

where < a < 2 and a > 0. The parameter a is called the 
stability exponent and a is a scale factor. Taking the inverse 
Fourier transform of tp a {z) we obtain the corresponding pdf 

p a {x): 



1058 



Giovani L. Vasconcelos 



p a (x) = 



2tt 



i r 

71" JO 



cos(zx)dz. 



(130) 



Unfortunately, however, only for two particular values of a 
can the integral above be explicitly calculated: 

• a = 1 (Lorentzian or Cauchy distribution): 

2a 1 



p(x) 



it x 2 + 4ct 2 



-x 2 /4a 



• a = 2 (Gaussian distribution): 

p{x) = sL e 

Note also that Levy distributions are not defined for 
a > 2, because in this case the function obtained from (130) 
is not everywhere positive. 

Although, for arbitrary a the pdf p a (x) cannot be found 
in closed form, its asymptotic behavior for large x can be 
easily calculated from (130). Here one finds [5] that 

C 

p a (x) k, | ° a , |a;| -> oo, (131) 



where 



a r(i 

7T 



ira 



(132) 



We thus see that the Levy distribution with a < 2 has the 
interesting property that it shows scaling behavior for large 
x, i.e., p(x) decays as a power-law. 

Another important scaling relation for the Levy distri- 
bution can be obtained, as follows. First note that for sym- 
metric stable distribution we necessarily have = 0. Now 
using (125) and (129), we easily find that the dilation factor 
ajv in (127) is 

a N = TV 1 /", (133) 

so that (127) becomes 



p a (x, TV) = 
which implies that 

p(0,N) 



Pc 



(TV 1 /^) 



TV 1 /" 



P(0) 

TV 1 /" 



(134) 



(135) 



One can then use this scaling relation to estimate the index a 
of the Levy distribution: in a log-log plot of p(0, TV) against 
TV, the slope of a linear fit gives precisely 1 /a; see Sec. 6.3 
below. 

The power-law decay of Levy distributions implies, of 
course, the absence of a characteristic scale. The downside 
of this is that all Levy distributions have infinite variance! 
In fact, all moments of order higher than 1 are infinite, since 
£7[|a;| n ] diverges for n > a, as can be readily shown from 
(131). Processes with infinite variance are not physically 
plausible, so several prescriptions to truncate the Levy dis- 
tribution at some large scale has been proposed, as discussed 
next. 



r 



6.2 Truncated Levy distributions 

To circumvent the problem of infinite variance in Levy dis- 
tributions, several truncation prescriptions have been pro- 
posed in the literature. In a general they can be written as 



p(x) =p a (x)®(x), 



(136) 



where <&(ar) is a cut-off function to be chosen in such way 
that the variance of the truncated distribution is finite. For 
example, two possible choices that have been used to model 
the distributions of financial asset prices are given below 

• Abruptly truncated Levy distribution (ATLD): 

$(x) = Q(x c -\x\), 

where Q(x) is the Heaviside function and x c is some cut-off 
length scale. 

• Exponentially truncated Levy distribution (ETLD): 

$(sc) = Ae~ x W, 

where A > and A is a normalization factor. 

Other variants of truncated Levy distributions that have 
also been considered in the literature are the gradually trun- 
cated Levy distribution [23] and the exponentially damped 
Levy distributions [24]. 

Since a truncated Levy distribution has finite variance, 
then by the central limit theorem the distribution of the sum 
X = X\ + ... + Xn of TV i.i.d variables with such a dis- 
tribution will converge to a Gaussian distribution for large 
TV. However, this convergence is usually very slow — for fi- 
nancial data it is typically in the order of tens of days; see 
below. For shorter time scales, non-Gaussian behavior may 
thus be of practical relevance. 



6.3 Levy distributions in Finance 

Levy distribution have been used, for example, by Man- 
tegna & Stanley [9] to model the distribution of changes in 
the stock index S&P500 of the American Stock Exchange. 
They analyzed high-frequency data (one-minute quotes of 
the S&P500) over the period from January 1984 to Decem- 
ber 1989. From the original time series Y(t) of the index 
values, they first generated time series corresponding to in- 
dex changes during intervals of TV minutes: 



Z N (t)=Y(t + N)-Y(t). 



(137) 
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They then computed the empirical pdf p(z, N) and analyzed 
the scaling of p(0,N) with N. In a log-log plot p(0,N) 
showed a linear behavior, as predicted by (135), with a slope 
corresponding to a = 1.4 for 30 < AT < 1000 minutes [9]. 
For N > 10 4 the slope of p(0, N) approaches —0.5, in- 
dicating convergence to a Gaussian behavior. In the Levy 
regime (i.e., small N), however, the tail of their empirical 
pdf decays slower than Gaussian but faster than a pure Levy 
distribution with the exponent a found from above scaling 
argument. These facts thus suggest that a truncated Levy 
distribution would perhaps be more appropriate for model- 
ing the actual distribution. Indeed, Bouchaud and Potters 
[10] found that the probability of 15-minute changes of the 
S&P500 index is well described by a ETLD with a = 1.5. 

The ETLD has also been applied by Miranda & Riera 
[25] to study the daily returns of Ibovespa index of the Sao 
Paulo Stock Exchange in the period 1986-2000. From the 
daily closing values Y(t) of the Ibovespa, they first calcu- 
lated the time series for the returns in intervals of TV days 

r N (t)=logY(t + N)-logY(t), (138) 

and then computed the corresponding pdf's for p(r,N). 
From the scaling of p(0, N) they found a ~ 1.6 — 1.7 for 
N < 20 days, whereas for larger N a Gaussian-like behav- 
ior (i.e., a = 0.5) was observed. 

Many other applications of Levy processes in Finance 
have been discussed in the literature [26]. For example, a 
model for option pricing has recently been considered where 
the price of the underlying asset is assumed to follow a trun- 
cated Levy process [27]. More recently, there have accu- 
mulated evidences [28, 29, 30] that in certain cases financial 
data may be better described by exponential distributions, 
rather than by Levy or Gaussian distributions. 

7 Beyond the Standard Model of Fi- 
nance II: Long-Range Correlations 

In this section, we discuss the possibility that asset prices 
might exhibit long-range correlations and thus may need to 
be described in terms of long-memory processes, such as the 
fractional Brownian motion. 

7.1 Fractional Brownian motion 

The fractional Brownian motion (FBM) is a Gaussian pro- 
cess {Wn{t),t > 0} with zero mean and stationary incre- 
ments, whose variance and covariance are given by 

E[W 2 H {t)] = t 2H , (139) 

E[W H (s)W H (t)] = l - (s 2H + t 2H -\t- s\ 2H ) , (140) 

where < H < 1. The FBM W H (t) is a self-similar pro- 
cess, in the sense that 

W H (at) = a H W H {t), (141) 



for all a > 0. A sample path of a FBM is therefore a frac- 
tal curve with fractal dimension D = l/H. The param- 
eter H is called the self-similarity exponent or the Hurst 
exponent. For H = 1/2 the process Wnit) corresponds 
to the usual Brownian motion, in which case the increments 
Xt = Wfr(i+1) — Wjy(i) are statistically independent, cor- 
responding to white noise. On the other hand, forH^ 1/2 
the increments X t , known as fractional white noise, display 
long-range correlation in the sense that 

E[X t+h X t ] ~ 2H(2H - l)h 2H - 2 for h -» oo, (142) 

as one can easily verify from (139) and (140). Thus, if 
1/2 < H < 1 the increments of the FBM are positively 
correlated and we say that the process ff/f(t) exhibits per- 
sistence. Likewise, for < H < 1/2 the increments are 
negatively correlated and the FBM is said to show antiper- 
sistence. Sample FBM paths with H = 0.2, 0.5, and 0.8 are 
shown in Fig. 9. 




Figure 9. Sample paths of fractional Brownian motion. 



Several estimators for the exponent H have been dis- 
cussed in the literature; see, e.g., Ref. [31] for a comparison 
among some of them. One general methodology consists 
in estimating how the 'amount of fluctuation' within a time 
window of size r scales with r. Specific methods, such as 
the Hurst rescaled range (R/S) analysis [32] or the Detren- 
dend Fluctuation Analysis [33, 34], differ basically on the 
choice of the fluctuation measure. Here I shall discuss only 
the DFA that has proven to be a more reliable estimator for 
H than the Hurst R/S analysis [35]. 

7.2 Detrended fluctuation analysis 

Suppose we have a time series r(t), t = 1,...,T, corre- 
sponding to, say, daily returns of a financial asset. To im- 
plement the DFA, we first integrate the original time series 
r(t) to obtain the cumulative time series X(t): 

t 

X(t) = J2(r(t')-f), t = l,...,T, (143) 

i'=l 
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where 



(144) 



Next we break up X(t) into N non-overlapping time inter- 
vals, /„, of equal size r, where n — 0, 1, ...,N — 1 and N 
corresponds to the integer part of T/t. We then introduce 
the local trend function Y T (t) defined by 



Y T (t) = a n + b n t for t € I n 



(145) 



where the coefficients a n and b n represent the least-square 
linear fit of X(t) in the interval I n . Finally, we compute the 
rescaled fluctuation function F(t) defined as [35] 



F(t) = 



N 



Nt 

J2iX(t)-Y T (t)} 2 , (146) 

TIT 



t=l 



where S is the data standard deviation 



\ 



1 



(147) 



The Hurst exponent H is then obtained from the scaling be- 
havior of F(t): 



F(t) = Ct 



H 



(148) 



where C is a constant independent of the time lag r. 

In a double-logarithmic plot the relationship (148) yields 
a straight line whose slope is precisely the exponent H, and 
so a linear regression of the empirical F(t) will immedi- 
ately give H. One practical problem with this method, how- 
ever, is that the values obtained for H are somewhat depen- 
dent on the choice of the interval within which to perform 
the linear fit [35, 36]. It is possible to avoid part of this diffi- 
culty by relying on the fact that for the fractional Brownian 
motion, the fluctuation function F(t) can be computed ex- 
actly [31]: 

F h (t) = C h t h , (149) 



where 



C 



H 



2 

2H+1 



1 

H + 2 



2 

H+l 



1/2 



(150) 



In (149) we have added a subscript H to the function F to 
denote explicitly that it refers to Wjj(i). Equation (149) 
with (150) now gives a one -parameter estimator for the ex- 
ponent H: one has simply to adjust H so as to obtain the 
best agreement between the theoretical curve predicted by 
Fh(t) and the empirical data for F(t). 

7.3 Fractional Brownian motion in Finance 

The idea of using the FMB for modeling asset price dy- 
namics dates back to the work of Mandelbrot & van Ness 
[37]. Since then, the Hurst exponent has been calculated 
(using different estimators) for many financial time series, 
such as stock prices, stock indexes and currency exchange 
rates [38, 39, 40, 41, 35]. In many cases [38] an exponent 



H > 1/2 has been found, indicating the existence of long- 
range correlation (persistence) in the data. It is to be noted, 
however, that the values of H computed using the traditional 
R/S-analysis, such as those quoted in [38], should be viewed 
with some caution, for this method has been shown [35] to 
overestimate the value of H. In this sense, the DFA appears 
to give a more reliable estimates for H . 

An example of the DFA applied to the returns of the 
Ibovespa stock index is shown in Fig. 10 (upper curve). In 
this figure the upper straight line corresponds to the theo- 
retical curve F h (t) given in (149) with H = 0.6, and one 
sees an excellent agreement with the empirical data up to 
r ~ 130 days. The fact that H > 0.5 thus indicates persis- 
tence in the Ibovespa returns. For r > 130 the data deviate 
from the initial scaling behavior and cross over to a regime 
with a slope closer to 1 /2, meaning that the Ibovespa looses 
its 'memory' after a period of about 6 months. Also shown 
in Fig. 10 is the corresponding F(t) calculated for the shuf- 
fled Ibovespa returns. In this case we obtain an almost per- 
fect scaling with H = 1/2, as expected, since the shuffling 
procedure tends to destroys any previously existing correla- 
tion. 







H=0.6 / 


io' 












F(x) 


ibovespa ^ 






shuffled data 


10° 







Figure 10. Fluctuation function F(j) as a function of r for the re- 
turns of the Ibovespa index (upper curve) and for the shuffled data 
(lower curve). The upper (lower) straight line gives the theoretical 
curve Fh(t ) for H = 0.6 (H = 1/2). 

As already mentioned, the Hurst exponent has been 
calculated for many financial time series. In the case of 
stock indexes, the following interesting picture appears to be 
emerging from recent studies [40, 41, 35]: large and more 
developed markets, such as the New York and the London 
Stock Exchanges, usually have H equal to (or slightly less 
than) 1/2, whereas less developed markets show a tendency 
to have H > 1/2. In other words, large markets seem in- 
deed to be 'efficient' in the sense that H ~ 1/2, whereas 
less developed markets tend to exhibit long-range correla- 
tion. A possible interpretation for this finding is that smaller 
markets are conceivably more prone to 'correlated fluctua- 
tions' and perhaps more susceptible to being pushed around 
by aggressive investors, which may explain in part a Hurst 
exponent greater than 1/2. 
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It should also be pointed out that a considerable time- 
variability of the exponent H for stock indexes has been 
found [41, 35], indicating that the data in such cases can- 
not be modeled in terms of stationary stochastic processes. 
(A time-varying H has also been observed in other financial 
data, such as, currency exchange rate [39].) In such cases, 
the stationarity assumption is only a rather crude approxi- 
mation [35]. Furthermore, the time dependence of the Hurst 
exponent is an indication that the underlying process might 
be multifractal rather than monofractal; see below. 



0.9 1 | i i i | i i i | i i i | i i i | i i i | i i i | i i i | i i i | 




°' 3 1968 1972 1976 1980 1984 1988 1992 1996 2000 

t 

Figure 11. The Hurst exponent H for the Ibovespa as a function of 
time. Here H was computed in two-year time intervals, with the 
variable t denoting the origin of each such interval. 

A time-varying Hurst exponent has been observed for 
the Brazilian stock market. This case is of particular inter- 
est because in the recent past Brazil was plagued by run- 
away inflation and endured several ill-fated economic plans 
designed to control it. To analyze the effect of inflation and 
the economic plans, Costa and Vasconcelos [35] calculated a 
time-varying Hurst exponent for the Ibovespa returns, com- 
puted in three-year time windows for the period 1968-2001. 
A similar analysis but with two-year time windows is shown 
in Fig. 11. One sees from this figure that during the 1970's 
and 1980's the curve H(t) stays well above 1/2, the only ex- 
ception to this trend occurring around the year 1986 when 
H dips momentarily towards 1/2 — an effect caused by the 
launch of the Cruzado economic Plan in February 1986 [35]. 
In the early 1990's, after the launching of the Collor Plan, we 
observe a dramatic decline in the curve H (t) towards 1/2, 
after which it remained (within some fluctuation) around 
1/2. This fact has led Costa and Vasconcelos [35] to con- 
clude that the opening and consequent modernization of the 
Brazilian economy that begun with the Collor Plan resulted 
in a more efficient stock market, in the sense that H ~ 0.5 
after 1990. In Fig. 11, one clearly sees that after the launch- 
ing of a new major economic plan, such as the Cruzado Plan 
in 1986 and the Collor Plan in 1990, the Hurst exponent de- 
creases. This effect has, of course, a simple economic inter- 



pretation: a Government intervention on the market is usu- 
ally designed to introduce "anti-persistent effects," which in 
turn leads to a momentary reduction of H. Note also that 
only after 1990 does the curves H(t) goes below 1/2. This 
finding confirms the scenario described above that more de- 
veloped markets (as Brazil became after 1990) tend to have 

if £ 1/2. 

7.4 Option pricing under the FBM assump- 
tion 

We have seen above that often times asset prices have a 
Hurst exponent different from 1/2. In such cases, the stan- 
dard Black-Scholes model does not apply, since it assumes 
that returns follow a Brownian motion (H = 0.5). A more 
appropriate model for the return dynamics would be the 
fractional Brownian motion (FBM). Indeed, a 'fractional 
Black-Scholes model' has been formulated, in which it is as- 
sumed that the stock price S follows a geometric fractional 
Brownian motion given by 

dS = fj.Sdt + <rSdW H , (151) 

where Wn{t) is the standard FBM. The fractional stochas- 
tic differential equation above is shorthand for the integral 
equation 

S(t) = S(0)+n f S{t')dt'+a [ S{t')dW H {t'). (152) 
Jo Jo 

To make mathematical sense of this equation is, of course, 
necessary to define stochastic integrals with respect to 
Wait). Here it suffices to say that a fractional Ito calcu- 
lus can indeed be rigorously defined [42] that shares (in an 
appropriate sense) many of the properties of the usual Ito 
calculus. In the context of this fractional Ito calculus is pos- 
sible to prove that the solution to (151) is given by 

S(t) = S(0) cxp j/rf - \o 2 t 2H + o-W H (t) J . (153) 

Compare this expression with (53). 

One can show [42] that the fractional Black-Scholes 
model is complete and arbitrage-free. To price derivatives 
with this model, one can apply the same A-hedging argu- 
ment used before, which now leads to the 'fractional Black- 
Scholes equation.' The result is summarized in the following 
theorem. 

Theorem 4 In the fractional Black-Scholes model, the 
price F(S, t) of a European contingent claim with payoff 
<J>(S(T)) is given by the solution to the following boundary- 
value problem 

g + +rS «*_ rP _0, 054) 

F(T,S) = $(S). (155) 
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[Compare with (88).] 

For the case of a European call option the solution to 
the problem above can be found in closed form. Alterna- 
tively, one can obtain the option pricing formula directly 
from the equivalent martingale measure, without having to 
solve the above PDE. The final result for this 'fractional 
Black-Scholes formula' is given below [43]. 

Theorem 5 In the fractional Black-Scholes model, the price 
of a European call option with strike price K and maturity 
T is given by 



C{S,t) = SJV(di) - Ke 



-r(T-t) 



N(d 2 ), 



where 
di 



In (f)+r(T-t) + \a 2 (T 2H -t 2H ) 



2H 



t 2H 



\n{^)+r{T-t)-\a 2 {T 2H 
aVT 2H - t 2H 



t 2H ) 



(156) 

(157) 
(158) 
(159) 



[Compare with (84).] 

The fractional Black-Scholes formula has been applied 
to price some options traded on the Brazilian market [44]. 
Here, however, the option prices obtained with the formula 
above resulted considerably higher than those from the usual 
Black-Scholes formula. The practical relevance of the frac- 
tional Black-Scholes model to real markets thus needs to be 
investigated further. 

7.5 Multifractality in Finance 

The fact that the Hurst exponents of financial data often dis- 
play considerable variability in time indicates, as already 
mentioned, that such time series cannot be satisfactorily 
modeled in terms of a fractional Brownian motion, which is 
characterized by a constant H and would thus capture only 
a sort of average behavior of the actual price dynamics [35]. 
In such cases, it would be more appropriate to model the 
data as a multifractal process. 

The notion of a multifractal was first introduced in the 
context of dynamical systems to describe physical processes 
taking place on a fractal support [45]. A rigorous exposition 
of multifractal measures is beyond the scope of the present 
notes, and so we will content ourselves with a rather intu- 
itive description of multifractality. The basic idea here is 
that a monofractal process, such as the FBM, is character- 
ized by a single exponent H, whereas for a multifractal a 
whole family (spectrum) of exponents is necessary, one for 
each moment of the distribution. 

We have seen above that the FBM is a Gaussian process 
whose standard deviation scales with time as 



yjE[W*(t)] = t H . 



(160) 



If we now introduce the generalized Hurst exponents H q as 
the corresponding scaling exponent for the 2q-th moment, 
that is, 

[E[w 2 H \t)]) 1/2q = C q t H % (161) 



where C q is a constant, it then immediately follows from 
property (19) of the Gaussian distribution that 



H q = H. 



(162) 



That is, all higher-order Hurst exponents of the FBM are 
equal to H itself and hence the FBM is said to be a 
monofractal. Our working definition of a multifractal will 
then be a process for which the generalized Hurst exponents 
H q vary with q, or alternatively, that the quantity qH q does 
not scale linearly with q. 

In general, any method used to calculate the Hurst ex- 
ponent H (see Sec. 7.1) can be adapted to obtain the gen- 
eralized exponents H q . For example, the multifractal gen- 
eralization of the DFA consists in calculating the gth-order 
fluctuation function F q (r), 



F q (r) = 



l/2q 



(163) 



In complete analogy with (148), the exponents H q are then 
obtained from the scaling 



F q (T)=CqT H <-. 



(164) 



[We remark parenthetically that the multifractal DFA de- 
fined above is slightly different from the formulation intro- 
duced in Ref. [46], but such minor distinctions do not matter 
in practice.] 

As an illustration of the multifractal DFA, we have ap- 
plied this method to the Ibovespa returns. For each q we 
computed the function F q (r) defined in (164), plotted it in 
a double-logarithmic scale, and obtained the generalized ex- 
ponent H q from the slope of the curve. In Fig. 12 it is plotted 
the resulting quantity qH q as a function of q. In this figure 
we clearly see that qH q deviates from the linear behavior ex- 
pected for a monofractal, thus indicating that the time series 
of Ibovespa returns does indeed display multifractal behav- 
ior. Evidences of multifractal behavior have been seen in 
several other stock indexes [41]. 

Multifractality has also been observed in many time se- 
ries for currency exchange rates, and this has motivated the 
suggestion that there might perhaps be a formal analogy be- 
tween turbulent flows and foreign currency exchange mar- 
kets; see Ref. [1 1] for references to the original literature. In 
turbulence [47], there is an energy cascade from the large 
scales (where energy is fed into the system) down to the 
small scales (where energy is dissipated by viscosity). In 
currency markets the analog would be a kind of information 
cascade in time from long-term investors (that lake a longer 
view of the market) to short-term investors (that watch the 
market very frequently) [11]. 
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Figure 12. The generalized Hurst exponents H q as a function of q 
for the Ibovespa. The dashed line indicates the linear behavior of a 
monofractal FBM with H = 0.6. 



A Some Basic Definitions from Prob- 
ability Theory 

A.l Probability space 

In probability theory one usually imagines performing an 
experiment in which chance intervenes. The occurrence or 
nonoccurrence of such an experiment is called an outcome 
ui. The set of all possible elementary outcomes is denoted 
by Ct. 

An event A is a set of outcomes, i.e., a subset of Ct. We 
are interested in attributing a probability to events in CI. If Ct 
is finite or countable, we could introduce a probability P(uo) 
for each individual outcome u> and then define the probabil- 
ity P(A) of an event A as the sum of the probabilities of all 
outcomes that make up the event A: 

p(A) = J2pH- 



8 Conclusions 

In these notes, I tried to present a basic introduction to an in- 
terdisciplinary area that has become known, at least among 
physicists working on the field, as Econophysics. I started 
out by giving some basic notions about financial derivatives 
and illustrated how to price them with a simple binomial 
model. After this motivational introduction, I offered a con- 
cise description of Brownian motion and stochastic calculus, 
which provide the necessary mathematical tools to describe 
financial asset prices in continuous time. I then proceeded 
to discuss the Standard Model of Finance (SMF), namely, 
the Black-Scholes model for pricing financial derivatives. 
The formulation of the Efficient Market Hypothesis, which 
lies at the heart of the SMF, in terms of martingales and its 
consequences for pricing derivatives were also discussed. 
Finally, I briefly reviewed some recent work done mostly, 
but not exclusively, by physicists that have produced evi- 
dences that the SMF may not fully describe real markets. In 
this context, some possible extensions of the Black-Scholes 
model were considered. 

I should like to conclude by mentioning that other al- 
ternatives approaches to the problem of pricing financial 
derivatives have been proposed by physicists, using meth- 
ods originally developed to treat physical problems. For in- 
stance, the option pricing problem was recently discussed 
in the context of the so-called non-extensive statistical me- 
chanics [48]. A "Hamiltonian formulation" for this problem 
was also given in which the resulting "generalized Black- 
Scholes" equation is formally solved in terms of path inte- 
grals [49]. We refer the reader to Ref. [11] for a brief review 
of these and other recent developments in Econophysics. 
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This procedure, however, will not work when Ct is uncount- 
able, i.e., CI is a continuous space such as R, since in this 
case the probability of any particular outcome is zero. Fur- 
thermore, a typical event will have uncountably many (i.e. 
a continuum of) outcomes. Hence the formula above is not 
applicable. That's why we need the notion of a probability 
measure to be defined shortly. 

To do this, first we need to specify the class of 'observ- 
able events', i.e., the subsets of Ct to which a probability 
can be associated. If Ct is finite or countable, a partition of CI 
would be the natural candidate. (Recall that a partition {Ai} 
of a set Ct is a collection of disjoint subsets, i.e., Ai C Ct and 
Ai fl Aj = for i 7^ j, whose union covers the whole set 
Ct, i.e., \J i Ai = Ct.) In the case of a continuous space this 
is not possible and a different class of subsets is in order. 
To be useful, such a class must be closed under the various 
set operations, such as union, intersection, complementarity, 
etc. This is done through the concept of a cr-algebra. 

Definition 10 A family T of subsets ofCt is a a-algebra on 
Ct if the following conditions are fulfilled: 

1. e J 7 andCteT 

2. A e T => A c e T, where A c = Ct \ A is the 
complement of A in Ct 

oo 

3. A 1 ,A 2 ,...eT=^ \jA,eT 

i=l 

The par (Ct, F) is called a measurable space. 

The elements A C T of the cr-algebra T are called mea- 
surable sets or simply events. The idea here is that for a 
given set A e T it is possible to ascertain whether any out- 
come w belongs or not to A, and in this sense the event A is 
observable. 

The smallest cr-algebra consists of the empty set and 
the set Ct itself, i.e., JF m i n = {0, Ct}. The largest cr-algebra, 



1064 



Giovani L. Vasconcelos 



on the other hand, is made up of all subsets of CI, which 
is known as the power set of CI and denoted by 2°, hence 
f max = 2°. Intermediate cr-algebras can be generated in 
the following way. Start with a given family U of subsets of 
CI and form the intersection of all cr-algebras that contain U: 

T u = [\{T \TdU}. (165) 

In other words, Tu is the smallest algebra that contains U as 
is called the algebra generated by U. 

We can now attribute a 'probability of occurrence' to 
events A G T via a probability measure on (CI, T). 

Definition 11 A probability measure P on the measurable 
space (CI, T) is a function P : T — > [0,1] such that 

1. P(0) = 0andP(Cl) = 1 

2. If A\, A2, ... G J 7 is a disjoint collection of elements 
ofT, i.e., A t UAj = if 'i ^ j, then P(U^Ai) = 

The triple (Cl, T, P) is called a probability space. 

A.2 Random variables 

Intuitively, a random variable X is a function that attributes 
to each outcome uo a real number x, i.e., X(u>) — x. We 
usually think of X as a random number whose value is de- 
termined by the outcome ui. A more formal definition is 
given in terms of measurable functions. 

Definition 12 Let (CI, T, P) be a probability space. A func- 
tion f : CI — > R is measurable with respect to the a-algebra 
T, or more compactly, T -measurable, if 

f-\U) = {coe Cl\f(uj) eU}eT, 

for all open sets U G R. 

The definition above means that for any given interval 
(a, b) C R there is a meaningful event A in CI. In an abuse 
of language we usually refer to this event as A — {a < f < 
b}. 

Definition 13 A random variable X on a probability space 
(CI, T, P) is a T -measurable function. 

A random variable X naturally generates a cr-algebra. 
This is the algebra generated by all the sets X~ 1 {U'), U C 
R open, and is denoted Tx- We think of Tx as representing 
the 'information' generated by the random variable X. The 
cr-algebra Tx contains the essential information about the 
structure of the random variable X; it contains all sets of the 
form {uj\a < X(uj) < b}. 

We also recall the definition of the probability distribu- 
tion F(x) of a random variable X: 

F(x) = P(X < x), for x G R. (166) 



Random variables can be either discrete, if the only assume 
a finite or countably number of values x\, X2, or continu- 
ous. Most continuous distributions of interest have a density 
f(x), i.e., a function f(x) such that 

F(x) = ( f(x)dx. (167) 

J — 00 

This allows us to compute the probability of a given event 
A = {a < X < b} explicitly through the formula 

P(A) = j f(x)dx. (168) 

J a 

A.3 Stochastic processes 

Intuitively, a stochastic process represents a dynamical sys- 
tem which evolve probabilistically in time. A formal defini- 
tion is given below. 

Definition 14 A stochastic process is a collection of random 
variables 

{x t } teT , 

defined on some probability space (CI, T, P) and 
parametrized by the variable t. 

We usually think of the label t as being time, so that X t 
would represent the (random) value of the quantity X, say, 
the price of a risky asset or the position of a Brownian parti- 
cle, at time t. For most of the cases, we consider T to be the 
halfline [0, 00). In this case we have a continuous-time pro- 
cess. Eventually, we shall also consider discrete-time pro- 
cesses, in which case the variable t assumes (non-negative) 
integer values, i.e., T = {0, 1,2, ...}. 

It is perhaps worth emphasizing that a stochastic pro- 
cess is a function of two variables. For a fixed time t, X t 
is a function of the random variable u>, i.e., X t = X t (uj). 
For a fixed outcome ui G CI, it is a function of time, 
Xt = X t (u>), t G T. This function of time is called a 
realization, path, or trajectory of the stochastic process X t . 
Note, in particular, that in this context an outcome corre- 
sponds an entire realization or trajectory of the stochastic 
process X. 

A stochastic process is usually described in terms of 
the distributions it induces. The finite-dimensional distri- 
butions of the stochastic process X t are the joint probabil- 
ity distributions p(x\ , t\; x n , t n ) of the random variables 
X tl X tn , for all possible choices of times t\,...,t n G T 
and every n > 1. The finite-dimensional distributions de- 
termine many (but not all) relevant properties of a stochastic 
process. A special class of stochastic processes are the sta- 
tionary ones. 

Definition 15 A stochastic process is said to be stationary 
if all its finite-dimensional distributions are invariant under 
a time translation, that is, 

p(x 1 ,t 1 + t; ...,x n ,t n + r) =p(x 1 ,t 1 ; ...,x n ,t n ), 

for any r > 0. 

Another important class of stochastic process are Gaus- 
sian processes, where all finite-dimensional distributions are 
(multivariate) Gaussians. 
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